PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 

International Bureau 



1 (SI) International Patent Classification 7 : 




(11) International Publication Number: 


WO 00/43503 


C12N 9/16 


Al 










(43) International Publication Date: 


27 July 2000 (27.07.00) 



(21) International Application Number: PCT/DK00/00025 

(22) International Filing Date: 21 January 2000 (21.01.00) 



(30) Priority Data: 

PA 1999 00092 
PA 1999 01340 



22 January 1 999 (22.0 1 .99) DK 
21 September 1999 (21.09.99) DK 



(71) Applicant: NOVO NORDISK A/S [DK/DK]; Enzyme Busi- 

ness Patents, Novo Alle\ DK-2880 Bagsvaerd (DK). 

(72) Inventor: LEHMANN, Martin; 258 Sayre Drive, Princeton, NJ 

07043 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE, 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT. LU, LV, MA, 
MD, MG, MK, MN, MW, MX. NO, NZ, PL, PT, RO, RU, 
SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ. UA, UG, 
UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, LS, 
MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, AZ, 
BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, 
CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, 
GN, GW, ML. MR, NE, SN, TD, TG). 



Published 

With international search report. 
With amended claims. 



(54) Title: IMPROVED PHYTASES 
(57) Abstract 

This invention relates to improved phytases, preferably phytases of an increased thermostability, and a process of producing them. In 
particular, stabilizing amino acid mutations are introduced into a homologous protein, or the active site of a phytase is replaced in part or 
in total. The corresponding DNA sequences and methods of preparing them are also disclosed, as are methods of producing the improved 
phytases, and the use thereof. Specific variants of Aspergillus fumigatus phytase and of consensus phytases are disclosed 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


Il- 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CC 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KK 


Republic of Korea 


FT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 00/43503 



1 



PCT/DKOO/00025 



Improved phytases 

Phytases are enzymes that hydrolyze phytate (myo- inositol 
hexakisphosphate) to myo-inositol and inorganic phosphate. They 
are known to be valuable feed additives. 

The present invention relates to improved phytases, viz. 
phytases of amended characteristics, e.g. amended activity 
characteristics, reference being made to e.g. the phytase(s) it 
has been derived from, or to known phytases. Amended activity 
characteristics means amended in at least one phytase activity 
related respect, such as (non- exclusive list) : pH stability, 
temperature stability, pH profile, temperature profile, specific 
activity (in particular in relation to pH and temperature) , 
substrate specificity, substrate cleavage pattern, substrate 
binding, position specificity, the velocity and level of release 
of phosphate from corn, reaction rate, phytate degradation 
rate) , end level of released phosphate reached. 

Examples of amended activity characteristics are amended 
specific activity (e.g. increased, e.g. increased at a pH of 3, 
4, 5, or 6); amended pH or temperature profile; and/or amended 
(e.g. increased) thermostability, e.g. of an increased melting 
temperature as measured using Differential Scanning Calorimetry 
(DSC) . 

The present invention also relates to a process for the 
preparation of a modified protein, wherein in a first step a 
consensus sequence is determined from a number of highly 
homologous sequences according to steps a) , b) and c) below: 

a) at least three, preferably at least four amino acid 
sequences are aligned by any standard alignment program known in 
the art; 
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b) at every position of the amino acid sequence alignment, 
the amino acids are evaluated for their evolutionary similarity 
and a consensus residue is chosen by any standard program known 
in the art, whereby the minimal requirements for calculation of 

5 a consensus residue are set in such a way that the program is 
already able to determine a consensus residue if a given residue 
occurs in only two of the aligned sequences. However, if there 
is a subgroup of sequences among the compared amino acid 
sequences that shows a much higher degree of similarity with 

10 each other than with the remaining sequences of the alignment, 
the subgroup may be represented in the calculation only with its 
consensus sequence determined in the same way as outlined in 
EP 897985, or alternatively, to each sequence of the subgroup, a 
vote weight of 1 divided by the number of sequences in the 

15 subgroup will be assigned; 

c) in case no consensus amino acid at a defined position 
is identified by the program, any of the amino acids, preferably 
the most frequently occurring amino acid at this position is 
selected . 

20 In a second aspect of the invention, a homologous sequence 

is compared with the consensus sequence, and one or more non- 
consensus residues in this homologous sequence are replaced by 
the corresponding consensus residues. 

Preferably, only such amino acid residues are replaced in 

25 the homologous amino acid sequence where a consensus residue can 
clearly be defined by the program under moderately stringent 
conditions whereas at all positions of the alignment where no 
preferred consensus amino acid can be determined under 
moderately stringent conditions, the amino acids of the 

3 0 homologous protein remain unchanged. 
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In a third aspect of the invention, the active center of 
the protein of interest is determined, comprising all amino acid 
residues that are involved in forming the active center, both in 
the consensus sequence, and in the sequence of a homologous 
5 protein; subsequently, some or all of the divergent amino acid 
residues of the homologous protein are inserted in the backbone 
of the consensus sequence . 

In one embodiment of this process, the program used for 
the comparison of amino acids at a defined position regarding 
10 their evolutionary similarity is the program "PRETTY". 

The active center of the protein can be determined by 
using an analysis of the three-dimensional structure of the 
protein. 

An example of a homologous protein is an enzyme family, an 
15 example of a defined protein family is the family of phytases, 
e.g. of fungal origin. 

For example, the amino acid sequence of the phytase can be 
changed by the introduction of at least one mutation or 
substitution chosen from 

20 



25 



30 



E58A 


F54Y 


D69K 


I73V 


D197N 


K94A 


T214L 


R101A 


E222T 


N153K 


E267D 


V158I 


R291I 


A203G 


R329H 


S205G 


S364T 


V217A 


A379K 


A227V 


G4 04A 


V234L 




P23 8A 
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Q2 77E 
A287H 
A2 92Q 
V366I 

5 A396S 

E415Q 
G43 7A 
R451E 

For interpreting these abbreviations, as an example, the 
10 mutation E58A is to be interpreted as follows: When subtracting 
2 6 from the number, you get the position or residue number in 
the consensus phytase sequence or another phytase sequence 
aligned as shown in Fig. 1 (corresponding to the addition of a 
26 amino acid signal sequence to the sequences shown in Fig. 1) . 
15 For example, in E58A, number 58 means position number 32 (58- 
26=32) . And the letter before the number, i.e. E, represents the 
amino acid in the phytase to be modified which is replaced by 
the amino acid behind the number, i.e. A. 

The above-mentioned amino acid replacements, alone and/or 
20 in combination, have a positive effect on the protein stability. 

The following sub-groups of mutations are also interesting 
(i.e. phytases comprising at least one mutation selected from 
either one of the groups of) : 

E58A, D69K, D197N, T214L, E222T, E267D, R291I, R329H, 
25 S364T, A379K, G404A; 

F54Y, I73V, K94A, R101A, N153K, V158I, A203G, S205G, 
V217A, A227V, V234L, P238A, Q277E, A287H, A292Q, V366I, A396S, 
E415Q, G437A, R451E; 

E58A, D69K, D197N, F54Y, I73V, K94A; 
30 T214L, E222T, E267DR101A, N153K, V158I; 

R291I, R329H, S364TA203G, S205G, V217A; 
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A379K, G404AA227V, V234L, P238A, Q277E; 
A287H, A292Q, V366I, A396S, E415Q, G437A, R451E; 
T214L, E222T, S364T, V158I, A203G, G404A, A227V, P238A, 
A396S, G437A, R451E. 

Examples of host cells are plant cells, animal cells, and 
microbial cells, e.g. prokaryotic or eukaryotic cells, such as 
bacterial, fungal or yeast cells. An example of a fungal host is 
a strain of the genus Aspergillus, and examples of yeast hosts 
are strains of Saccharomyces , and strains of Hansenula. 

The invention also relates to a modified protein 
obtainable or obtained by any of the processes described above. 

The invention also relates to a variant or mutein of a 
phytase such as (but not limited to) the consensus phytase-1, 
wherein, in the amino acid sequence in Figure 2, at least one 
of the following replacements have been effected: Q50L, Q50T, 
Q50G, Q50T-Y51N, Q50L-Y51N or Q50T-K91A. 

In the third aspect mentioned above, a consensus sequence 
is determined from homologous sequences as described above; in a 
second step the active center of the protein comprising all 
amino acid residues that are involved in forming the active 
center is determined in the consensus sequence and in the 
sequence of a single homologous protein as well. The single 
homologous protein may have preferred properties like high 
specific activity or different pH dependency of enzymatic 
activity. In a third step some or all amino acid residues that 
are involved in forming the active center of the homologous 
protein are inserted into the backbone of the consensus 
sequence. The result thereof is a chimeric protein having the 
active center derived from a single protein and the backbone of 
the consensus sequence . 
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The active center of the protein can be determined e.g. by 
using any analysis of the three-dimensional structure of the 
protein, e.g. by homology modelling on the basis of a known 3D- 
structure of a known protein. 

The present invention also provides consensus proteins 
obtainable or obtained by such processes, in particular proteins 
comprising at least one of the amino acid sequences shown in 
Figures 2-6, 10 or 21, or variants or muteins thereof. Examples 
of such variants are shown in Figs. 7-9. 

Such variants or muteins can be defined and prepared on 
the basis of the teachings given in European Patent Application 
number 0897010, e.g. Q50L, Q50T, Q50G, Q50L-Y51N, or Q50T-Y51N. 
These mutations are defined as above, or, alternatively, by 
reference to Fig. 2. When referring to Fig. 2, no subtraction of 
the 26 amino acid signal peptide is required (e.g. in l! Q50L, " at 
position 50 of the amino acid sequence of Fig. 2, the amino acid 
Q has been replaced by amino acid L) . 

A food, feed, or pharmaceutical composition comprising the 
phytases of the invention is another aspect of the invention. 

In this context, and relating to the process of the 
invention, "at least three, preferably at least four amino acid 
sequences of such defined protein family" means that three, 
four, five, six to twelve, twenty, fifty, or even more sequences 
can be used for the alignment and the comparison to create the 
amino acid sequence of the consensus protein. "Sequences of a 
defined protein family" means that such sequences fold into a 
three-dimensional structure, wherein the alpha-helices, the 
beta-sheets and beta-turns are at the same position so that such 
structures are, as called by the man skilled in the art, largely 
superimposable . Furthermore these sequences characterize 
proteins that show the same type of biological activity, e.g. a 
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defined enzyme class, e.g. the phytases . The three-dimensional 
structure of one such protein is sufficient to allow the 
modelling of the structure of the other homologous proteins of 
such a family. An example, how this can be done, is given in 
5 Example 1. "Evolutionary similarity" in the context of the 
present invention refers to a scheme which classifies amino 
acids regarding their structural similarity which allows that 
one amino acid can be replaced by another amino acid with a 
minimal influence on the overall structure, as this is done e.g. 

10 by programs, like "PRETTY", known in the art. The phrase "the 
degree of similarity provided by such a program... is set to less 
stringent number" means in the context of the present invention 
that values for the parameters which determine the degree of 
similarity in the program used in the practice of the present 

is invention are chosen in a way to allow the program to define a 
consensus amino acid for a maximum of positions of the whole 
amino acid sequence, e. g. in case of the program PRETTY a value 
of 2 or 3 for the THRESHOLD and a value of 2 for the PLURALITY 
can be chosen. Furthermore, "a vote weight of one divided by the 

20 number of such sequences" means in the context of the present 
invention that the sequences which define a group of sequences 
with a higher degree of similarity as the other sequences used 
for the determination of the consensus sequence only contribute 
to such determination with a factor which is equal to one 

25 divided by the number of all sequences of this group. 

As mentioned before, should the program not allow to 
select the consensus amino acid, the most frequent amino acid is 
selected; should the latter be impossible the man skilled in the 
art will select an amino acid from all the sequences used for 

30 the comparison which is known in the art for its property to 
improve the thermostability in proteins as discussed e.g. by 
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Janecek, S. (1993) , Process Biochem. 28, 435-445; Fersht, A. R. 
Sc Serrano, L. (1993) , Curr. Opin. Struct. Biol. 3, 75-83; Alber, 
T. (1989), Annu. Rev. Biochem. 58, 765-798; Matthews, B. W. 
(1987), Biochemistry 26, 6885-6888; or Matthews, B. W. (1991), 
Curr. Opin. Struct. Biol. 1, 17-21. 

The stability of an enzyme is relevant for many industrial 
applications. Therefore, a lot of attempts, more or less 
successful, have been made to improve the stability, preferably 
the thermostability of enzymes by rational or random approaches. 

Here we present an alternative way to improve the 
thermostability of a protein. 

The invention provides a process for the preparation of a 
consensus protein comprising a process to calculate an amino 
acid residue for nearly all positions of a so-called consensus 
protein and to synthesize a complete gene from this sequence 
that can be expressed in a pro- or eukaryotic expression system. 

DNA sequences of the present invention can be constructed 
starting from genomic or cDNA sequences encoding the proteins, 
e.g. phytases, of interest. For example, they can be constructed 
by methods of in vitro mutagenesis [see e.g. Sambrook et al . , 
Molecular Cloning, Cold Spring Harbor Laboratory Press, New 
York]. A widely used strategy for "site-directed mutagenesis", 
as originally outlined by Hurchinson and Edgell [J. Virol. 8, 
181 (1971)], involves the annealing of a synthetic 
oligonucleotide carrying the desired nucleotide substitution to 
a target region of a single- stranded DNA sequence wherein the 
mutation should be introduced [for review see Smith, Annu. Rev. 
Genet. 19, 423 (1985), and for improved methods, see references 
2-6 in Stanssen et al . , Nucl . Acids Res., 17, 4441-4454 (1989). 
Another possibility of mutating a given DNA sequence is the 
mutagenesis by using the polymerase chain reaction (PCR) . DNA as 
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starting material can be isolated by methods known in the art 
and described e.g. in Sambrook et al . (Molecular Cloning) from 
the respective strains. 

For strain information, see e.g. EP 684313 or any 
depository authority indicated below. Aspergillus niger [ATCC 
9142] , Myceliophthora thermophila [ATCC 48102] , Talaromyces 
thermophilus [ATCC 20186] and Aspergillus fumigatus [ATCC 34625] 
have been redeposited according to the conditions of the 
Budapest Treaty at the American Type Culture Cell Collection 
under the following accession numbers: ATCC 74337, ATCC 74340, 
ATCC 74338 and ATCC 74339, respectively. It is, however, 
understood that DNA encoding a consensus protein in accordance 
with the present invention can also be prepared in a synthetic 
manner as described, e.g. in EP 747483 or EP 897985, or in the 
examples, by methods known in the art. 

For sequence information, see e.g. EP 684313, or sequence 
data bases, for example like Genbank (Intelligenetics, 
California, USA) , European Bioinf ormatics Institute (Hinston 
Hall, Cambridge, GB) , NBRF (Georgetown University, Medical 
Centre, Washington DC, USA) and Vecbase (University of 
Wisconsin, Biotechnology Centre, Madison, Wisconsin, USA) . 

The process of the present invention can e.g. be used to 
improve the thermostability of the enzyme phytase . 

Once complete DNA sequences of the present invention have 
been obtained they can be integrated into vectors by methods 
known in the art and described e.g. in Sambrook et al . (s.a.) to 
overexpress the encoded polypeptide in appropriate host systems. 
However, a man skilled in the art knows that also the DNA 
sequences themselves can be used to transform the suitable host 
systems of the invention to get overexpress ion of the encoded 
polypeptide. Appropriate host systems are for example fungi, 
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like Aspergilli, e.g. Aspergillus niger [ATCC 9142] or 
Aspergillus ficuum [NRRL 3135] or like Trichoderma, e.g. 
Trichoderma reesei; or yeasts, like Saccharomyces , e.g. 
Saccharomyces cerevisiae or Pichia, like Pichia pastoris, or 
Hansenula polymorpha, e.g. H. polymorpha (DSM5215) ; or plants, 
as described, e.g. by Pen et al . , Bio/Technology 11, 811-814 
(1994) . A man skilled in the art knows that such microorganisms 
are available from depository authorities, e.g. the American 
Type Culture Collection (ATCC) , the Centraalbureau voor 
Schimmelcultures (CBS) or the Deutsche Sammlung fur 
Mikroorganismen und Zellkulturen GmbH (DSM) or any other 
depository authority as listed in the Journal "Industrial 
Property" [(1991) 1, pages 29-40]. Bacteria which can be used 
are e.g. E. coli; Bacilli as, e.g., Bacillus subtilis; or 
Streptomyces, e.g. Streptomyces lividans (see e.g. Anne and 
Mallaert in FEMS Microbiol. Lett. 114, 121 (1993). Preferred E. 
coli strains, which can be used are E. coli K12 strains e.g. M15 
[described as DZ 291 by Villarejo et al . in J. Bacteriol . 120, 
466-474 (1974)], HB 101 [ATCC No. 33694] or E. coli SG13009 
[Gottesman et al . , J. Bacteriol. 148, 265-273 (1981)]. 

Vectors which can be used for expression in fungi are 
known in the art and described e.g. in EP 420358, or by Cullen 
et al. [Bio/Technology 5, 369-376 (1987)], Ward [Molecular 
Industrial Mycology, Systems and Applications for Filamentous 
Fungi, Marcel Dekker, New York (1991)], Upshall et al . 
[Bio/Technology 5, 1301-1304 (1987)], Gwynne et al . 
[Bio/Technology 5, 71-79 (1987)], or Punt et al . [J. Biotechnol. 
17, 19-34 (1991)]; and for yeasts by Sreekrishna et al . [J. 
Basic Microbiol. 28, 265-278 (1988), Biochemistry 28, 4117-4125 
(1989)], Hitzemann et al . [Nature 293, 717-722 (1981)] or in EP 
183070, EP 183071, EP 248227, or EP 263311. Suitable vectors 
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which can be used for expression in E. coli are mentioned, e.g. 
by Sambrook et al . [s.a.], Fiers et al . [Procd. 8th Int. 
Biotechnology Symposium", Soc . Franc. de Microbiol., Paris 
(Durand et al . , eds.), pp. 680-697 (1988)], Bujard et al . [Meth. 
Enzymol. 155, 416-433 (1987)], or Stuber et al . [Immunological 
Methods, eds. Lefkovits and Pernis, Academic Press, Inc., Vol. 
IV, 121-152 (1990)]. Vectors that can be used for expression in 
Bacilli are known in the art and described, e.g. in EP 207459, 
EP 405370, Proc. Natl. Acad. Sci. USA 81, 439 (1984) or Yansura 
and Henner, Meth. Enzymol. 185, 199-228 (1990). Vectors which 
can be used for the expression in H. Polymorpha are known in the 
art and described, e.g. in Gellissen et al . , Biotechnology 9, 
291-295 (1991) . 

Either such vectors already carry regulatory elements, 
e.g. promotors, or the DNA sequences of the present invention 
can be engineered to contain such elements. Suitable promotor 
elements which can be used are known in the art and are, e.g. 
for Trichoderma reesei the cbhl- [Haarki et al . , Biotechnology 
7, 596-600 (1989)] or the pkil -promotor [Schindler et al . , Gene 
130, 271-275 (1993)]; for Aspergillus oryzae the amy-promotor 
[Christensen et al . , Abstr. 19th Lunteren Lectures on Molecular 
Genetics F23 (1987), Christensen et al . , Biotechnology 6, 1419- 
1422 (1988), Tada et al . , Mol . Gen. Genet. 229, 301 (1991)]; and 
for Aspergillus niger the glaA- [Cullen et al . , Bio/Technology 
5, 369-376 (1987), Gwynne et al . , Bio/Technology 5, 713-719 
(1987) , Ward in Molecular Industrial Mycology, Systems and 
Applications for Filamentous Fungi, Marcel Dekker, New York, 83- 
106 (1991)], alcA- [Gwynne et al . , Bio/ Technology 5, 718-719 
(1987)], sucl- [Boddy et al . , Curr. Genet. 24, 60-66 (1993)], 
aphA- [MacRae et al . , Gene 71, 339-348 (1988), MacRae et al . , 
Gene 132, 193-198 (1993)], tpiA- [McKnight et al . , Cell 46, 143- 
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147 (1986), Upshall et al . , Bio/Technology 5, 1301-1304 (1987)], 
gpdA- [Punt et al . , Gene 69, 49-57 (1988), Punt et al . , J. 
Biotechnol. 17, 19-37 (1991)] and the pkiA-promotor [de Graaff 
et al., Curr. Genet. 22, 21-27 (1992)]. Suitable promotor 
5 elements that can be used for expression in yeast are known in 
the art and are, e.g. the pho5-promotor [Vogel et al . , Mol . 
Cell. Biol., 2050-2057 (1989); Rudolf and Hinnen, Proc . Natl. 
Acad. Sci. 84, 1340-1344 (1987)] or the gap-promotor for 
expression in Saccharomyces cerevisiae; the aoxl -promotor [Koutz 
10 et al., Yeast 5, 167-177 (1989); Sreekrishna et al . , J. Basic 
Microbiol. 28, 265-278 (1988)] for Pichia pastoris; or the FMD 
promoter [Hollenberg et al . , EPA No. 0299108] or MOX-promotor 
[Ledeboer et al . , Nucl . Acids Res. 13, 3063-3082 (1985)] for H. 
polymorpha . 

is Accordingly vectors comprising DNA sequences of the 

present invention, preferably for the expression of said DNA 
sequences in bacteria or a fungal or a yeast host and such 
transformed bacteria or fungal or yeast hosts are also a part of 
the invention. 

20 The invention also provides a system that allows for high 

expression of proteins, in particular of the phytases of the 
invention, such as recombinant Hansenula strains. To achieve 
that, the codons of the DNA sequence of such a protein may be 
selected on the basis of a codon frequency table of the organism 

25 used for expression, e.g. of yeast as in the present case (see 
e.g. in Example 1) . Optionally, the codons for the signal 
sequence may be selected in a manner as described for the 
specific case in Example 1; that means that a codon frequency 
table is prepared on the basis of the codons used in the DNA 

30 sequences which encode the amino acid sequences of the given 
protein family. Then the codons for the design of the DNA 
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sequence of the signal sequence are selected from a codon 
frequency table of the host cell used for expression whereby 
always codons of comparable frequency in both tables are used. 

Once such DNA sequences have been expressed in an 
appropriate host cell in a suitable medium, the encoded protein 
can be isolated either from the medium in the case the protein 
is secreted into the medium or from the host organism in case 
such protein is present intracellularly by methods known in the 
art of protein purification or described in case of a phytase, 
e.g. in EP 420358. Accordingly, a process for the preparation of 
a polypeptide of the present invention wherein transformed 
bacteria or a host cell as described above are cultured under 
suitable culture conditions, and the polypeptide is recovered 
therefrom and a polypeptide when produced by such a process; or 
a polypeptide encoded by a DNA sequence of the present 
invention, are also a part of the present invention. 

Once obtained, the polypeptides of the present invention 
can be characterized regarding their properties that make them 
useful in agriculture by any assay known in the art. 

In general, the polypeptides of the present invention can 
be used without being limited to a specific field of 
application, e.g. in case of phytases for the conversion of 
inositol polyphosphates, like phytate, to inositol and inorganic 
phosphate . 

Furthermore, the polypeptides of the present invention can 
be used in a process for the preparation of a pharmaceutical 
composition or compound food or feeds wherein the components of 
such a composition are mixed with at least one polypeptide of 
the present invention. Accordingly, compound food or feeds or 
pharmaceutical compositions comprising at least one polypeptide 
of the present invention are also a part of the present 
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invention. A man skilled in the art is familiar with their 
process of preparation. Such pharmaceutical compositions or 
compound foods or feeds can further comprise additives or 
components generally used for such purpose and known in the 
5 state of the art. 

The present invention also provides a process for the 
reduction of levels of phytate in animal manure wherein an 
animal is fed such a feed composition in an amount effective in 
converting phytate contained in the feedstuff to lower inositol 
10 phosphates and/or inositol, and inorganic phosphate. 

In the present context, a phytase is an enzyme or 
polypeptide that has phytase activity. A phytase can be e.g. a 
myo- inositol hexakisphosphate phosphohydrolase, such as (myo- 
inositol hexakisphosphate 3 -phosphohydrolase, EC 3.1.3.8) and 
15 (myo-inositol hexakisphosphate 6 -phosphohydrolase, EC 3.1.3.26). 

In one embodiment, the phytase is purified, viz. at least 
85%, preferably at least 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 
96, 97, 98, 99% pure, as evaluated by SDS-PAGE. The phytase may 
be isolated. Phytase activity can be determined using any 
20 phytase assay known in the art, e.g. the assay described herein 
(see Example 9) . The assay temperature may be the optimum 
temperature of the actual phytase, and the assay pH may be the 
optimum pH of the actual phytase. 

The assay temperature may e.g. be selected within the 
25 range of 20-90°C, or 30-80°C, or 35-75<>C, for instance 
temperatures of 37°C, 50°C, 60°C, or 70°C. 

The assay pH may e.g. be selected within the range of pH 
2-9, or 3-8, or 3-6, for instance assay pH values of 3, 4, 5, 6, 
or 7 may be chosen. 
30 Amino acid sequence homology (or polypeptide or amino acid 

homology) is determined as the degree of identity between two 
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sequences. This may suitably be determined by means of computer 
programs known in the art such as GAP provided in the GCG 
program package [Program Manual for the Wisconsin Package, 
Genetics Computer Group, 575 Science Drive, Madison, Wisconsin 
53711, USA], see also Needleman, S.B. and Wunsch, CD., (1970), 
J. Mol. Biol., 48, 443-453]. In release 9.1, for comparing 
polypeptide sequences, the Length Weight is set to 0, and the 
Gap Weight is set to 3.0. 

The degree of identity or homology between two DNA 
(nucleic acid) sequences may be determined by means of computer 
programs known in the art such as GAP provided in the GCG 
program package [Program Manual for the Wisconsin Package, 
Genetics Computer Group, 575 Science Drive, Madison, Wisconsin 
53711, USA), see also Needleman, S.B. and Wunsch, CD., (1970), 
J. Mol. Biol., 48, 443-453]. In release 9.1, GAP is used with 
the following settings for DNA sequence comparison: GAP creation 
penalty of 50 and GAP extension penalty of 3. 

Suitable experimental conditions for determining whether a 
given DNA or RNA sequence hybridizes to a specified nucleotide 
or oligonucleotide probe involves presoaking of the filter 
containing the DNA or RNA fragments to examine for hybridization 
in 5 x SSC (Sodium chloride/Sodium citrate; (J. Sambrook, E.F. 
Fritsch, and T . Maniatis, 1989, Molecular Cloning, A Laboratory 
Manual, 2nd edition, Cold Spring Harbor, New York) for 10 min, 
and prehybridization of the filter in a solution of 5 x SSC, 5 x 
Denhardt's solution, 0.5 % SDS and 100 fig/ml of denatured 
sonicated salmon sperm DNA (Sambrook et al . 1989), followed by 
hybridization in the same solution containing a concentration of 
10 ng/ml of a random-primed (Feinberg, A. P. and Vogelstein, B. 
(1983) Anal. Biochem. 132:6-13), 32 P-dCTP-labeled (specific 
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activity > 1 x 10 9 cpm//xg) probe for 12 hours at approximately 
45°C. 

The filter is then washed twice for 30 minutes in 2 x SSC, 
0.5 % SDS at at least 55°C (low stringency), at at least 60°C 
(medium stringency) , at at least 65 °C (medium/high stringency) , 

at at least 70 °C (high stringency) , or at at least 75 °C (very 

high stringency) . 

Molecules to which the oligonucleotide probe hybridizes 
under these conditions can be detected using an x-ray film. 

Phytases of amended thermostability, or thermostable 
phytases, are one aspect of the present invention. A 
"thermostable" phytase is a phytase that has a Tm (melting 
temperature) - as measured on purified phytase protein by 
Differential Scanning Calorimetry (DSC) - of at least 65°C. For 
the DSC, a constant heating rate may be used, e.g. of 10°C/min. 
In alternative embodiments, the Tm is at least 66, 67, 68, 69, 
70, 71, 72, 73, 74 or 75°C. Or, the Tm is equal to or lower than 
150°C, or equal to or lower than 145, 140, 135, 130, 125, 120, 
115 or 110°C. Accordingly, examples of intervals of Tm are: 65- 
150°C, 66-150°C, - (etc.) - 75-150°C; 65-145°C, 66»145°C, - 
(etc.) - 75-145°C; 65-140°C, - (etc.) - 75-140°C; - (etc.) - 65- 
110°C, 66-110°C, - (etc.) - 75-110°C. 

Particular ranges for Tm are the following: between 65 and 
110°C; between 70 and 110°C; between 70 and 100°C; between 75 
and 95 °C, or between 8 0 and 90 °C. 

In Examples 9 and 10 below, the measurement of Tm by DSC 
is described, and the Tm's of a number of phytases are shown. 

The optimum temperatures are also indicated, since - as an 
alternative mean - a thermostable phytase can be defined as a 
phytase having a temperature -optimum of at least 60 °C. 
Preferably, the optimum temperature is determined on the 
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substrate phytate or phytic acid at pH 5.0 or 5.5. Example 9 
describes an example of a phytase assay, including a definition 
of units. 

In alternative embodiments, the optimum temperature is at 
least 61, 62, 63, 64, 65, 66, 61, 68, 69 or 70°C. In a 
particular embodiment, the optimum temperature is equal to or 
lower than 140°C, or equal to or lower than 135, 130, 125, 120, 
115, 110, 105 or 100°C. Accordingly, examples of intervals of 
optimum temperature are: 60-140°C, 61-140°C, - (etc.) - 70- 
140°C; 60-135°C, 61-135*C, - (etc.) - 70-135°C; 60-130°C, - 
(etc.) - 70-130°C; - (etc.) - 60-100°C, 61-100°C, - (etc.) - 70- 
100°C. 

Before describing the present invention in more detail a 
short explanation of the Figures enclosed is given below. 

Figure 1 : Design of the consensus phytase -1 sequence. The 
letters represent the amino acid residues in the one-letter 
code. The following sequences were used for the alignment: phyA 
from Aspergillus terreus 9A-1 [Mitchell, D. B., Vogel, K. , 
Weimann, B. J., Pasamontes, L. & van Loon, A. P. G. M. (1997) 
The phytase subfamily of histidine acid phosphatases: isolation 
of genes for two novel phytases from the fungi Aspergillus 
terreus and Myceliophthora thermophila, Microbiology 143, 245- 
252); from amino acid (aa) 27; SEQ ID NO: 1]; phyA from A. 
terreus cbsll6.46 [EP 897985] . A heat resistant phytase of 
Aspergillus fumigatus with superior performance in animal 
experiments. Phytase optimization and natural variability. In: 
The Biochemistry of phytate and phytases (eds. Rasmussen, S.K; 
Raboy, V.; Dalbejge, H. and Loewus, F. ; Kluwer Academic 
Publishers); from aa 27; SEQ ID NO: 2; phyA from Aspergillus 
niger var. awamori (Piddington et al (1993) Gene 133, 55-62; 
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from aa 27; SEQ ID NO: 3); phyA from A. niger T213 (EP 897985); 
from aa 27; SEQ ID NO: 4) ; phyA from A. niger strain NRRL3135 
[van Hartingsveldt, W. , van Zeijl, C. M. F., Harteveld, G. M. , 
Gouka, R. J. / Suykerbuyk, M. E. G. , Luiten, R. G. M. , van 
Paridon, P. A., Selten, G. C. M. , Veenstra, A. E. , van Gorcom, 
R. F. M . , & van den Hondel, C. A. M. J. J. (1993) Cloning, 
characterization and overexpression of the phytase -encoding gene 
(phyA) of Aspergillus niger. Gene 127, 87-94; from aa 27; SEQ ID 
NO: 5] ; phyA from Aspergillus fumigatus ATCC 13 073 (Pasamontes, 
L., Haiker, M. , Wyss, M. , Tessier, M. & van Loon, A. P. G. M. 
(1997) Cloning, purification and characterization of a heat 
stable phytase from the fungus Aspergillus fumigatus, Appl . 
Environ. Microbiol. 63, 1696-1700; from aa 25; SEQ ID NO: 6] ; 
phyA from A. fumigatus ATCC 32722 (EP 897985); from aa 27; SEQ 
ID NO: 7); phyA from A. fumigatus ATCC 58128 (EP 897985); from 
aa 27; SEQ ID NO: 8); phyA from A. fumigatus ATCC 26906 
(EP 897985); from aa 27; SEQ ID NO: 9); phyA from A. fumigatus 
ATCC 32239 (EP 897985); from aa 30; SEQ ID NO: 10; phyA from 
Emericella nidulans [Pasamontes, L. , Haiker, M. , Henriquez- 
Huecas, M . , Mitchell, D. B. & van Loon, A. P. G. M. (1997a). 
Cloning of the phytases from Emericella nidulans and the 
thermophilic fungus Talaromyces thermophilus . Biochim. Biophys. 
Acta 1353, 217-223; from aa 25; SEQ ID NO: 11]; phyA from 
Talaromyces thermophilus (Pasamontes et al . , 1997a; from aa 24; 
SEQ ID NO: 12) ; and phyA from Myceliophthora thermophila 
(Mitchell et al . , 1997; from aa 19; SEQ ID NO: 13). The 
alignment was calculated using the program PILEUP. The location 
of the gaps was refined by hand. Capitalized amino acid residues 
in the alignment at a given position belong to the amino acid 
coalition that establish the consensus residue. In bold, beneath 
the calculated consensus sequence, the amino acid sequence of 
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the finally constructed consensus phytase (Fcp) is shown (SEQ ID 
NO: 14) . The gaps in the calculated consensus sequence were 
filled by hand according to principals stated in Example 1. 

Figure 2: DNA sequence (SEQ ID NO: 15) of the consensus 
phytase-1 gene (fcp) and of the primers used for the gene con- 
struction. The calculated amino acid sequence (Figure 1, SEQ ID 
NO: 14) was converted into a DNA sequence using the program 
BACKTRANS LATE [Devereux, J., Haeberli, P. & Smithies, O. (1984) 
A comprehensive set of sequence analysis programs for the VAX. 
Nucl. Acids Res. 12, 387-395], and the codon frequency table of 
highly expressed yeast genes (GCG program package, 9.0). The 
signal peptide of the phytase from A. terreus cbs 116.46 was 
fused to the N-terminus. The amino acid sequence shown in Fig. 2 
is SEQ ID NO: 16. The bold bases represent the sequences of the 
oligonucleotides used to generate the gene. The names of the re- 
spective oligonucleotides are alternately noted above or below 
the sequence. The underlined bases represent the start and stop 
codon of the gene. The bases written in italics represent the 
two introduced Eco RI sites . 



Figure 3: Alignment and consensus sequence of five 

Basidiomycete phytases . The letters represent the amino acid 
residues in the one-letter code. The amino acid sequences of the 
phytases from Paxillus involutus, phyAl (from aa 21; SEQ ID NO: 
17; and phyA2 (from aa 21, WO 98/28409; SEQ ID NO: 18); Trametes 
pubescens (from aa 24, WO 98/28409; SEQ ID NO: 19); Agrocybe 
pediades (from aa 19, WO 98/28409; SEQ ID NO: 20); and 
Peniophora lycii (from aa 21, WO 98/28409; SEQ ID NO: 21), 
starting with the amino acid residues mentioned in parentheses, 
were used for the alignment and the calculation of the 
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corresponding consensus sequence called "Basidio" (Example 2; 
SEQ ID NO: 22) . The alignment was performed with the program 
PILEPUP. The location of the gaps was refined by hand. The 
consensus sequence was calculated by the program PRETTY. While a 
5 vote weight of 0.5 was assigned to the two P. involutus 
phytases, all other genes were used with a vote weight of 1.0 
for the consensus sequence calculation. At positions where the 
program was not able to determine a consensus residue, the 
Basidio sequence contains a dash. Capitalized amino acid 
10 residues in the alignment at a given position represent the 
amino acid coalition that established the consensus residue. 

Figure 4 : Design of consensus phytase-10 amino acid 
sequence. By adding the sequence of Thermomyces lanuginosus 

15 phytase [Berka, R. M. , Rey, M . W. , Brown, K. M., Byun, T. & 
Klotz, A. V. (1998) Molecular characterization and expression of 
a phytase gene from the thermophilic fungus Thermomyces 
lanuginosus. Appl . Environ. Microbiol. 64, 4423-4427; SEQ ID NO: 
23] and the consensus sequence of the phytases from five 

20 Basidiomycetes (SEQ ID NO: 22) to the alignment of Figure 1, an 
improved consensus sequence was calculated by the program 
PRETTY. Additionally, the amino acid sequence of A. niger T213 
was omitted, and a vote weight of 0.5 was assigned to the 
remaining two A. niger phytase sequences. For further 

25 information see Example 2. 

Figure 5 : DNA and amino acid sequence of consensus 
phytase-10 (SEQ ID NO: 25, and SEQ ID NO: 26, respectively) . The 
amino acid sequence is written above the corresponding DNA 
30 sequence using the one-letter code. The sequence of the 
oligonucleotides that were used to assemble the gene are in 
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bold letters. The names of the respective oligonucleotides and 
the amino acids that differ relative to consensus phytase-1 are 
underlined. The fcplO gene was assembled from the following 
oligonucleotides: CP-1, CP-2, CP-3.10, CP-4.10, CP-5.10, CP-6, 
5 CP-7.10, CP-8.10, CP-9.10, CP-10.10, CP-11.10, CP-12.10, CP- 
13.10, CP-14.10, CP-15.10, CP-16.10, CP-17.10, CP18.10, CP- 
19.10, CP-20.10, CP-21.10, and CP-22.10. The newly synthesized 
oligonucleotides are additionally marked by the number 10. The 
phytase contains the following 32 exchanges relative to 

10 consensus phytase-1: Y54F, E58A, D69K, D70G, A94K, N134Q, I158V, 
S187A, Q188N, D197N, S204A, T214L, D220E, L234V, A238P, D246H, 
T251N, Y259N, E267D, E277Q, A283D, R291I, A320V, R329H , S364T, 
I366V, A379K, S396A, G4 04A , Q415E, A437G, A463E. The underlined 
mutations revealed a stabilizing effect on consensus phytase-1 

15 when tested as single mutations in consensus phytase-1. 

Figure 6 : Alignment for the design of consensus phytase- 11 
(SEQ ID NO: 27) . In contrast to the design of consensus phytase - 
10, for the design of the amino acid sequence of consensus 
20 phytase-11, all Basidiomycete phytases were used as independent 
sequences using an assigned vote weight of 0.2 for each 
Basidiomycete sequence. Additionally, the amino acid sequence of 
A. niger T213 was again used in this alignment. 

25 Figure 7 : DNA and amino acid sequence of consensus 

phytase- 1-thermo [8] -Q50T-K91A (SEQ ID NO: 28, and SEQ ID NO: 29, 

respectively) . The amino acid sequence is written above the 

corresponding DNA sequence using the one-letter code. The 

replaced amino acid residues (relative to consensus phytase-1) 

30 are underlined. The stop codon of the gene is marked by a star 
(*) . 
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Figure 8 : DNA and amino acid sequence of consensus 
phytase- 10 -thermo [3] -Q50T-K91A (SEQ ID NO: 30, and SEQ ID NO: 
31, respectively) . The amino acid sequence is written above the 
corresponding DNA sequence using the one-letter code. The 
replaced amino acid residues (relative to consensus phytase-10) 
are underlined. The stop codon of the gene is marked by a star 
(*) - 

Figure 9 : DNA and amino acid sequence of A. fumigatus ATCC 
13073 phytase alpha-mutant Q51T (SEQ ID NO: 32 , and SEQ ID NO: 
33, respectively) . The amino acid sequence is written above the 
corresponding DNA sequence using the one-letter code. The 
replaced amino acid residues (relative to A. fumigatus ATCC 
13073 phytase) are underlined. The stop codon of the gene is 
marked by a star (*) . 

Figure 10 : DNA and amino acid sequence of consensus 
phytase-7 (SEQ ID NO: 34, and SEQ ID NO: 35, respectively) . The 
amino acids are written above the corresponding DNA sequence 
using the one-letter code. The sequence of the oligonucleotides 
used to assemble the gene are in bold letters. Oligonucleotides 
and amino acids that were exchanged (relative to consensus 
phytase-1) are underlined and the corresponding triplets are 
written in small case letters. The fcp7 gene was assembled from 
the following oligonucleotides: CP-1, CP-2, CP-3, CP-4.7, CP- 
5.7, CP-6, CP-7, CP-8.7, CP-9, CP-10.7, CP-11.7, CP-12.7, CP- 
13.7, CP-14.7, CP-15.7, CP-16, CP-17.7, CP-18.7, CP-19.7, CP-20, 
CP-21, and CP-22. The newly synthesized oligonucleotides are 
additionally marked by the number 7. Consensus phytase-7 
contains the following 24 exchanges in comparison to the 
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original consensus phytase-1: S89D, S92G, A94K, D164S, P201S, 
G203A, G205S, H212P, G224A, D226T, E255T, D256E, V258T, P265S, 
Q292H, G300K # Y305H, A314T, S364G, M365I, A397S, S398A, G404A, 
and A4 05S . 

Figure 11 : Differential scanning calorimetry (DSC) of 
consensus phytase-1 and consensus phytase-10. The protein 
samples were concentrated to about 50-60 mg/ml and extensively 
dialyzed against 10 mM sodium acetate, pH 5.0. A constant 
heating rate of 10°C/min was applied up to 95 °C. DSC of 
consensus phytase-10 (upper graph) yielded a melting temperature 
of 85.4°C / which is 7.3°C higher than the melting point of 
consensus phytase-1 (78.1°C / lower graph). 

Figure 12 : Differential scanning calorimetry (DSC) of 
consensus phytase-10-thermo [3] -Q50T and consensus phytase-10- 
thermo [3] -Q50T-K91A. The protein samples were concentrated to 
ca. 50-60 mg/ml and extensively dialyzed against 10 mM sodium 
acetate, pH 5.0. A constant heating rate of 10°C/min was applied 
up to 95°C. DSC of consensus phytase-10-thermo [3] -Q50T (upper 
graph) yielded a melting temperature of 88. 6°C / while the 
melting temperature of consensus phytase-10- thermo-Q50T-K91A was 
determined to be 89.3°C. 

Figure 13 : Comparison of the temperature optimum between 
consensus phytase-1, consensus phytase-10 and consensus phytase- 
10-thermo [3] -Q50T. For the determination of the temperature 
optimum, the phytase standard assay of Example 9 was performed 
at a series of temperatures between 3 7 and 86°C. The diluted 
supernatant of transformed S. cerevisiae strains was used for 
the determination. The other components of the supernatant had 
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no influence on the determination of the temperature optimum: a, 

consensus phytase-1; consensus phytase-10; ■ , consensus 

phytase 10-thermo [3] -Q50T. 

Figure 14 : pH-dependent activity profile and substrate 
specificity of consensus phytase-10 and its variants thermo [3]- 
Q50T and thermo [3] -Q50T-K91A. The phytase activity was 
determined using the standard assay in appropriate buffers (see 
Example 9) at different pH-values. Graph a) shows the pH- 
dependent activity profile of consensus phytase-10 (□) , 

consensus phytase- 10- thermo [3] -Q50T (.), and consensus phytase- 
10-thermo [3] -Q50T-K91A (a). Graph b) shows the corresponding 
substrate specificity tested by replacement of phytate in the 
standard assay by the indicated compounds; open bars, consensus 
phytase-10; grey bars, consensus phytase-10- thermo [3] -Q50T; dark 
bars, consensus phytase-10 -thermo [3] -Q50T-K91A) . The numbers 
correspond to the following substrates: 1, phytate; 2, p- 
nitrophenyl phosphate; 3, phenyl phosphate; 4, f ructose-1 , 6- 
bisphosphate; 5, fructose- 6 -phosphate; 6, glucose- 6 -phosphate; 
7, ribose- 5 -phosphate; 8, DL-glycerol -3 -phosphate; 9, glycerol - 
2 -phosphate; 10, 3-phosphoglycerate; 11, phosphoenolpyruvate ; 
12, AMP; 13, ADP; 14, ATP. 

Figure 15 : pH-dependent activity profile and substrate 
specificity of consensus phytase-1 -thermo [8] -Q50T and of 
consensus phytase-1 -thermo [8] -Q50T-K91A. The phytase activity 
was determined using the standard assay in appropriate buffers 
(see Example 9) at different pH-values. Graph a) shows the pH- 

dependent activity profile of the Q50T- (■) and the Q50T-K91A- 
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variant ( . ) . Graph b) shows the corresponding substrate 
specificities tested by replacement of phytate in the standard 
assay by the indicated compounds (open bars, consensus phytase- 
1-thermo [8] -Q50T; filled bars, consensus phytase-l-thermo [8] - 
5 Q50T-K91A) . The substrates are listed in the legend of Figure 
14 . 

Figure 16 : Differential scanning calorimetry (DSC) of 
consensus phytase-l-thermo [8] -Q50T and consensus phytase-1- 

10 thermo [8] -Q50T-K91A. The protein samples were concentrated to 
ca. 50-60 mg/ml and extensively dialyzed against 10 mM sodium 
acetate, pH 5.0. A constant heating rate of 10°C/min was applied 
up to 95°C. DSC of consensus phytase-l-thermo [8] -Q50T (upper 
graph) showed a melting temperature of 84.7°C, while the melting 

15 point of consensus phytase-l-thermo [8] -Q50T-K91A was found at 
85. 7°C. 

Figure 17 : Comparison of the temperature optimum between 
consensus phytase-1, consensus phytase-l-thermo [3] and consensus 
20 phytase-l-thermo [8] . For the determination of the temperature 
optimum, the phytase standard assay was performed at a series of 
temperatures between 37 and 86°C. Protein purified from the 
supernatant of transformed S. cerevisiae strains was used for 
the determination. O, consensus phytase-1; □, consensus 

25 phytase-l-thermo [3] ; ^, consensus phytase l-thermo[8]. 

Figure 18 : Comparison of the pH-dependent activity profile 
and substrate specificity between consensus phytase-1, consensus 
phytase-7, and the phytase from A. niger NRRL 3135. The phytase 
30 activity was determined using the standard assay in appropriate 
buffers (see Example 9) at different pH-values. Graph a) shows 
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the pH-dependent activity profile of consensus phytase-1 (■) , 
the phytase from A. niger NRRL 3135 (O) , and of consensus 

phytase-7 (^). Graph b) shows the corresponding substrate 
specificities tested by replacement of phytate in the standard 
assay by the indicated compounds (black bars, A. niger NRRL 3135 
phytase; open bars, consensus phytase-1; dashed bars, consensus 
phytase-7) . The substrates are listed in the legend of Figure 
14. 

Figure 19 : Differential scanning calorimetry (DSC) of the 
phytase from A. fumigatus ATCC 13073 and of its stabilized 
alpha-mutant, which contains the following amino acid exchanges: 
F55Y, V100I, F114Y, A243L, S265P, and N294D. 

The protein samples were concentrated to ca. 50-60 mg/ml 
and extensively dialyzed against 10 mM sodium acetate, pH 5.0. A 
constant heating rate of 10°C/min was applied up to 95°C. DSC of 
A. fumigatus 13073 phytase (lower graph) revealed a melting 
temperature of 62.5°C, while the melting point of the alpha- 
mutant was found at 67.0°C. 

Figure 20 : Comparison of the temperature optima of A. 
fumigatus 13073 wild-type phytase, its alpha-mutant, and a 
further stabilized alpha-mutant (E59A-S154N-R329H-S364T-G404A) . 
For the determination of the temperature optimum, the phytase 
standard assay was performed at a series of temperatures between 
37 and 75 °C. The diluted supernatant of transformed S. 
cerevisiae strains was used for the determination. The other 
components of the supernatant had no influence on the 
determination of the temperature optimum. O, A. fumigatus ATCC 

13073 phytase; ^, A. fumigatus ATCC 13073 alpha-mutant; □, A. 
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fumigatus ATCC 13073 alpha-mutant- (E59A-S154N-R329H-S364T- 

G404A)-Q27T; ■, A. fumigatus ATCC 13073 alpha-mutant- (E59A- 
S154N-R329H-S364T-G404A) -Q51T-K92A. Q51T and K92A correspond to 
consensus phytase-1 substitutions Q50T and K91A # respectively. 

Figure 21 : Amino acid sequence of consensus phytase-12 
(consphyl2; SEQ ID NO: 36) which contains a number of active 
site residues transferred from the "basidio" consensus sequence 
to consensus phytase-10- thermo [3] -Q50T-K91A (underlined). 

Figure 22 : DNA and amino acid sequence of consensus 
phytase-3-thermo[ll] -Q50T. The amino acids are written below the 
corresponding DNA sequence using the one-letter code. 

Figure 23 : DNA and amino acid sequence of consensus 
phytase-3 -thermo [11] -Q50T-K91A. The amino acids are written 
below the corresponding DNA sequence using the one-letter code. 

Figure 24 : DNA and amino acid sequence of consensus 
phytase-10 -thermo [5] -Q50T. The amino acids are written below the 
corresponding DNA sequence using the one-letter code. 

Figure 25 : DNA and amino acid sequence of consensus 
phytase-10 -thermo [5] -Q50T-K91A. The amino acids are written 
below the corresponding DNA sequence using the one-letter code. 

The phytase -producing microorganism strains mentioned 
herein, viz. Paxillus involutus CBS 100231; Peniophora lycii CBS 
686.96; Agrocybe pediades CBS 900.96; and Trametes pubescens CBS 
100232; were isolated from natural samples originating from, 
respectively, Denmark; Denmark; Denmark; and Sweden (the Uppsala 
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collection. The samples were collected in November 1992; October 
1993; June 1995; and in November 1995, respectively. 

Example 1 
5 Consensus phytase -1 

The amino acid sequence of consensus phytase- 1 (fungal 
consensus phytase, fcp) was designed and calculated as described 
in Examples 1 and 2 of EP 897985. Table 1 below shows the origin 
and vote weight of the phytase amino acid sequences used for the 
design of consensus phytase-1. The consensus phytase-1 sequence 
was furthermore converted into a DNA sequence as described in 
Example 3 of EP 897985, and the consensus phytase-1 gene was 
constructed and cloned as described in Example 4 of EP 897985. 

Table 1 

Origin and vote weight of the phytase amino acid sequences 

- phyA from Aspergillus terreus 9A-1, aa 27, vote weight 0.5 
(Mitchell et al . , 1997) 

- phyA from Aspergillus terreus cbsll6.46, aa 27, vote weight 
0.5 (EP 897985) 

- phyA from Aspergillus niger var. awamori, aa 27, vote weight 
0.33 [Piddington, C. S., Houston, C. S., Paloheimo, M. , 
Cantrell, M. , Miettinen-Oinonen, A., Nevalainen, H., & Rambosek, 
J. (1993) The cloning and sequencing of the genes encoding 
phytase (phy) and pH 2.5-optimum acid phosphatase (aph) from 
Aspergillus niger var. awamori. Gene 133, 55-62] . 

- phyA from Aspergillus niger T213 (EP 897985), aa 27, vote 
weight 0.33 

- phyA from Aspergillus niger strain NRRL3135, aa 27, vote 
weight 0.33 (van Hartingsveldt et al . , 1993) 
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- phyA from Aspergillus fumigatus ATCC 13073, aa 26, vote 
weight 0.2 (Pasamontes et al . , 1997) 

- phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote 
weight 0.2 (EP 897985) 

5 - phyA from Aspergillus fumigatus ATCC 58128, aa 26, vote weight 
0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote 
weight 0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 32239, aa 30, vote weight 
10 0.2 (EP 897985) 

- phyA from Emericella nidulans , aa 25, vote weight 1.0 
(Pasamontes et al . , 1997a) 

- phyA from Talaromyces thermophilus ATCC 20186, aa 24, vote 
weight 1.0 (Pasamontes et al . , 1997a) 

15 - phyA from Myceliophthora thermophila, aa 19, vote weight 1.0 
(Mitchell et al . , 1997) 

Example 2 

Design of an improved consensus phytase (consensus 
20 phytase- 10) amino acid sequence 

The alignments used for the design of consensus phytase- 10 
were calculated using the program PILEUP from the GCG Sequence 
Analysis Package Release 9.0 (Devereux et al . , 1984) with the 
25 standard parameters (gap creation penalty 12, gap extension 
penalty 4) . The location of the gaps was refined using a text 
editor. 

The following sequences were used for the alignment of the 
Basiodiomycete phytases starting with the amino acid (aa) 
30 mentioned in Table 2: 
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Table 2 

Origin and vote weight of five Basidiomycete phytases used for 
the calculation of the corresponding consensus amino acid 
sequence (basidio) 

- phyAl from Paxil lus involutus CBS No. 100231, aa 21, vote 
weight 0.5 (WO 98/28409) 

- phyA2 from Paxillus involutus CBS No. 100231, aa 21, vote 
weight 0.5 (WO 98/28409) 

- phyA from Trametes pubescens CBS No. 100232, aa 24, vote 
weight 1.0 (WO 98/28409) 

- phyA from Agrocybe pediades CBS No. 900.96, aa 19, vote weight 
1.0 (WO 98/28409) 

- phyA from Peniophora lycii CBS No. 686.96, aa 21, vote weight 
1.0 (WO 98/28409) 

The alignment is shown in Figure 3. 

In Table 3 the genes that were used for the final 
alignment are listed. The first amino acid (aa) of the sequence 
that is used in the alignment is mentioned behind the organism's 
designation. 

Table 3 

Origin and vote weight of the phytase sequences used for the 
design of consensus phytase-10 

- phyA from Aspergillus terreus 9A-1, aa 27, vote weight 0.5 
(Mitchell et al . , 1997) 

- phyA from Aspergillus terreus cbsl!6.46, aa 27, vote weight 
0.5 (EP 897985) 
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- phyA from Aspergillus niger var. awamori, aa 27, vote weight 
0.5 (Piddington et al . , 1993) 

- phyA from Aspergillus niger strain NRRL3135, aa 27, vote 
weight 0.5 (van Hartingsveldt et al . , 1993) 

5 - phyA from Aspergillus fumigatus ATCC 13073 , aa 26, vote 
weight 0.2 (Pasamontes et al . , 1997) 

- phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote 
weight 0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 5812 8, aa 26, vote weight 
10 0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote weight 
0.2 (EP 897985) 

- phyA from Aspergillus fumigatus ATCC 32239, aa 30, vote weight 
0.2 (EP 897985) 

15 - phyA from Emericella nidulans , aa 25, vote weight 1.0 
(Pasamontes et al . , 1997a) 

- phyA from Talaromyces thermophilus ATCC 2 0186, aa 24, vote 
weight 1.0 (Pasamontes et al . , 1997a) 

- phyA from Myceliophthora thermophila, aa 19, vote weight 1.0 
20 (Mitchell et al . , 1997) 

- phyA from Thermomyces lanuginosus, aa 36, vote weight 1.0 
(Berka et al . , 1998) 

- Consensus sequence of five Basidiomycete phytases, vote weight 
1.0 (Basidio, Figure 3) 

25 

The corresponding alignment is shown in Figure 4. 

Calculation of the amino acid sequence of consensus 
phytase-10 

3 0 To improve the alignment, we added the original consensus 

sequence of five phytases from four different Basidiomycetes 
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(called Basidio; still containing the undefined sequence 
positions; see Figure 3), nearly all phytase sequences used for 
the calculation of the original consensus phytase sequences and 
one new phytase sequence from the Ascomycete Thermomyces 

5 lanuginosus to a larger alignment. 

We set plurality on 2 . 0 and threshold on 3. The used vote 
weights are listed in Table 3. The alignment and the 
corresponding consensus sequence are presented in Figure 4 . The 
new consensus phytase sequence has 32 different amino acids in 

o comparison to the original consensus phytase-1. Positions for 
which the program PRETTY was not able to calculate a consensus 
amino acid residue were filled according to rules mentioned in 
Example 1. None of the residues suggested by the program was 
replaced. 

5 Furthermore, in another calculation, we included all 

Basidiomycete phytases as single amino acid sequences but 
assigning a vote weight of 0.2 in the calculation. The 
corresponding alignment is shown in Figure 6. The calculated 
consensus amino acid sequence (consensus phytase- 11) has the 

) following differences to the sequence of consensus phytase-10. 
Letter X means that the program was not able to calculate a 
consensus amino acid; the amino acid in parenthesis corresponds 
to the amino acid finally included into consensus phytase-10. 

D35X (first letter for consensus phytase-10, last letter 

> for consensus phytase-11) , X(K)69K, X(E)100E, A101R, Q134N, 
X(K)153N, X(H)190H / X(A)204S / X(E)220D, E222T, V227A, X(R)271R, 
H287A, X(D)288D, X(K) 379K, X(I)389I, E390X, X(E)415E, X(A)416A, 
X(R)446L, E463A. The numbering is as in Fig. 5. 

We also checked single amino acid replacements suggested 

> by the improved consensus sequences 10 and 11 on their influence 



WO 00/43503 PCT/DK00/00025 

33 

on the stability of the original consensus phytase-1. The 
approach is described in example 3 . 

Conversion of the consensus phytase-10 amino acid sequence 
5 into a DNA sequence 

The first 26 amino acid residues of A. terreus cbsll6.46 
phytase were used as signal peptide and fused to the N-terminus 
of consensus phytase-10. The used procedure is further described 
in Example 1 . 

10 The resulting sequence of the fcplO gene is shown in 

Figure 5 . 

Construction and cloning of the consensus phytase-10 gene 
(fcplO) 

15 The calculated DNA sequence of fcplO was divided into 

oligonucleotides of 85 bp, alternately using the sequence of the 
sense and the anti-sense strand. Every oligonucleotide overlaps 
20 bp with the previous and the following oligonucleotide of the 
opposite strand. The location of all primers, purchased from 

20 Microsynth, Balgach (Switzerland) and obtained in a PAGE- 
purified form, is indicated in Figure 5. 

PCR-Reactions 

In three PCR reactions, the synthesized oligonucleotides 
25 were composed to the entire gene. For the PCR, the High Fidelity 
Kit from Boehringer Mannheim (Boehringer Mannheim, Mannheim, 
Germany) and the thermo cycler x *The ProtokolTM" from AMS 
Biotechnology (Europe) Ltd. (Lugano, Switzerland) were used. The 
following oligonucleotides were used in a concentration of 0.2 
30 pMol/ml. 
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Mix 1.10: CP-1, CP-2, CP-3.10, CP-4.10, CP-5.10, CP-6, 

CP-7.10, CP-8.10, CP-9.10, CP-10.10 



Mix 2.10: CP-9.10, CP-11.10, CP-12.10, CP-13.10, CP- 
5 14.10, CP-15.10, CP-16.10, CP-17.10, CP18.10, 

CP-19.10, CP-20.10, CP-21.10, CP-22.10 
The newly synthesized oligonucleotides are marked by the 
number 10. Consensus phytase-10 contains the following 32 
exchanges, which are underlined in Figure 5, in comparison to 
10 the original consensus phytase-1: Y54F, E58A, D69K, D70G, A94K, 
N134Q, I158V, S187A, Q188N, D197N, S204A, T214L, D220E, L234V, 
A238P, D246H, T251N, Y259N, E267D, E277Q, A283D, R291I, A320V, 
R329H, S364T, I366V, A379K, S396A, G404A, Q415E, A437G, A463E. 

Four short PCR primers were used for the assembling of the 
15 oligonucleotides : 

CP-a: Eco RI 

5 ' - TATATGAA TTCATGGGCGTGTTCGTC - 3 ' ( SEQ ID NO : 3 7) 

20 CP-b: 

5 ' - TGAAAAGTTCATTGAAGGTTTC - 3 ' ( SEQ ID NO : 38) 

CP-c.10: 

5 ' - TCTTCGAAAGCAGTACACAAAC - 3 ' ( SEQ ID NO : 3 9) 

25 

CP-e: Eco RI 

5 ' - TATATGAATTCTTAAGCGAAAC - 3 ' (SEQ ID NO: 40) 
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10 /xl Mix 1.10 (2.0 pmol of each 
ol igonuc 1 eot ide ) 

2 /xl nucleotides (10 mM of each nucleotide) 
2 /il primer CP-a (10 pmol /ml) 
2 /il primer CP-c.10 (10 pmol /ml) 
10,0 /il PGR buffer 

0.75 /xl polymerase mixture (2.6 U) 
73.2 5 /il H 2 0 

PCR reaction b: 10 /xl Mix 2.10 (2.0 pmol of each 

oligonucleotide) 

2 /xl nucleotides (10 mM each nucleotide) 
2 /xl primer CP-b (10 pmol/ml) 
2 /xl primer CP-e (10 pmol/ml) 
10 , 0 /xl PCR buffer 

0.75 /xl polymerase mixture (2.6 U) 
73.25 /xl H 2 0 

Reaction conditions for PCR reactions a and b: 



step 


1 


2 min - 


45°C 


step 


2 


3 0 sec 


- 72°C 


step 


3 


30 sec 


- 94°C 


step 


4 


30 sec 


- 52°C 


step 


5 


1 min - 


72°C 



Steps 3 to 5 were repeated 40-times. 

The PCR products (670 and 905 bp) were purified by agarose 
gel electrophoresis (0.9% agarose), followed by gel extraction 
(QIAEX II Gel Extraction Kit, Qiagen, Hilden, Germany) . The 
purified DNA fragments were used for the PCR reaction c. 
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PCR reaction c: 6 /zl PCR product of reaction a «50 ng) 

6 /zl PCR product of reaction b »50 ng) 

2 /zl primer CP-a (10 pmol/ml) 

2 /zl primer CP-e (10 pmol/ml) 

10,0 /zl PCR buffer 

0.75 /zl polymerase mixture (2.6 U) 
73 . 25 /zl H 2 0 



Reaction conditions for PCR reaction c: 

step 1 2 min - 94 °C 

step 2 30 sec - 94°C 

step 3 30 sec - 55°C 

step 4 1 min - 72 °C 



Steps 2 to 4 were repeated 31-times. 



The resulting PCR product (1.4 kb) was purified as 
mentioned above, digested with EcoRI, and ligated in an EcoRI- 
digested and dephosphorylated pBsk(-) -vector (Stratagene, La 
Jolla, CA, USA) . 1 ill of the ligation mixture was used to 
transform E. coli XL-1 competent cells (Stratagene, La Jolla, 
CA, USA) . All standard procedures were carried out as described 
by Sambrook et al . (1987). The DNA sequence of the constructed 
gene (fcplO) was checked by sequencing as known in the art. 
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Example 3 

Increasing the thermostability of consensus phytase-1 by 
introduction of single mutations suggested by the amino 

acid sequences of consensus phytase-10 and consensus 
5 phytase-11 



In order to increase the thermostability of homologous 
genes, it is also possible to test the stability effect of each 
differing amino acid residue between the protein of interest and 

10 the calculated consensus sequence and to combine all stabilizing 
mutations into the protein of interest . We used the consensus 
phytase-1 as protein of interest and tested the effect on the 
protein stability of 34 amino acid residues that differ relative 
to consensus phytase-10 and/or -11 by single site-directed 

15 mutagenesis. 

To construct muteins for expression in A, niger, S. 
cerevisiae, or H. polymorpha, the corresponding expression 
plasmid containing the consensus phytase-1 gene was used as 
template for site-directed mutagenesis (see Examples 6-8) . 

20 Mutations were introduced using the "quick exchangeTM site- 
directed mutagenesis kit" from Stratagene (La Jolla, CA, USA) 
following the manufacturer's protocol and using the 
corresponding primers. All mutations made and the corresponding 
primers are summarized in Table 4. Plasmids harboring the 

25 desired mutation were identified by DNA sequence analysis as 
known in the art . 
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Table 4 



Primers used for site-directed mutagenesis of consensus phytase- 
1 

Exchanged bases are highlighted in bold. The introduction of a 
restriction site is marked above the sequence. When a 
restriction site is written in parenthesis, the mentioned site 
was destroyed by introduction of the mutation. 



mutation 



Q50T 



Y54F 



E58A 



Primer set 

Kpn I 

5 ' - CACTTGTGGGGTACCTACTCTCCATACTTCTC - 3 ' ( SEQ ID NO : 41) 
5 ' - GAGAAGTATGGAGAGTAGGTACCCCACAAGTG - 3 ' 

5' -GGTCAATACTCTCCATTCTTCTCTTTGGAAG-3 ' (SEQ ID NO: 42) 
5 ' -CTTCCAAAGAGAAGAATGGAGAGTATTGACC- 3 ' 

5 ' - CATACTTCTCTTTGGCAGACGAATCTGC - 3 ' ( SEQ ID NO : 43) 
5 ' -GCAGATTCGTCTGCCAAAGAGAAGTATG- 3 1 



D69K 



Aat II 

5 ' - CTCCAGACGITCCCAAAGGACTGTAGAGTTAC - 3 ' (SEQ ID NO: 44) 
5 ' -GTAACTCTACAGTCCTTTGGGACGTCTGGAG- 3 ' 



D70G 



K91A 



A94K 



A101R 



Aat II 

5' -CTCCAGACGTCCCAGACGGCTGTAGAGTTAC-3 ' (SEQ ID NO: 45) 
5 ' - GTAACTCTACAGCCGTCTGGGACGTCTGGAG - 3 ' 

5 ' -GATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTG- 3 ' 

(SEQ ID NO: 46) 
5 ' -CAGAGTAAGCCTTAGACGCAGAAGAAGTTGGGTATC-3 ' 

Sea I 

5 ' -CTTCTAAGTCTAAGAAGTACTCTGCTTTG-3 ' (SEQ ID NO: 47) 
5 ' -CAAAGCAGAGTACTTCTTAGACTTAGAAG-3 ' 

5 ' - GCTTACTCTGCTTTGATTGAACGGATTCAAAAGAACGCTAC- 3 ' 

(SEQ ID NO: 48) 
5' - GTAGCGTTCTTTTGAATCCGTTCAATCAAAGCAGAGTAAGC - 3 ' 



N134Q 



5 ' - CCATTCGGTGAACAGCAAATGGTTAACTC - 3' (SEQ ID NO: 49) 
5 ' -GAGTTAACCATTTGCTGTTCACCGAATGG- 3 ' 
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Nru I 

-GATACAAGGCTCTCGCGAGAAACATTGTTC -3' (SEQ ID NO: 50) 
-GGAACAATGTTTC7rGO?AGAGCCTTGTATC-3 ' 

Bss HI 

-GATTGTTCCATTCGTGCGCGCTTCTGGTTC-3' (SEQ ID NO : 51) 
-GAACCAGAAGCGCGCACGAATGGAACAATC-3 ' 

Apa I 

-GGCTGACCCAGGGGCCCAACCACACCAAGC-3 ' (SEQ ID NO: 53) 
- GCTTGGTGTGGTTGGGCCOCTGGGTCAGCC - 3 ' 

Bel I 

-CTCCAGTTATTAACGTGATCATTCCAGAAGG-3 ' (SEQ ID NO: 52) 
-CCTTCTGGAATGATCACGTTAATAACTGGAG- 3 ' 

Nco I 

-CACTTTGGACCATGGTCTTTGTACTGCTTTCG-3 ' (SEQ ID NO: 54) 

- CGAAAGCAGTACAAAGACGA TGGTCCAAAGTG - 3 ' 

Avr II 

- GCTTTCGAAGACTCTACCC TA GGTGACGACGTTG - 3 ' 

(SEQ ID NO: 55) 

-CAACGTCGTCACCrAGGGTAGAGTCTTCGAAAGC-3 ' 

- GGTGACGACGCTGAAGCTAACTTCAC - 3 ' (SEQ ID NO: 56) 
• GTGAAGTTAGCTTCAGCGTCGTCACC - 3 ' 

Sac II 

CTAACTTCACCGCGGTGTTCGCTCCAG-3 ' (SEQ ID NO: 57) 
CTGGAGCGAACACCGCGGTGAAGTTAG- 3 ' 

GCTTTGTTCGCTCCACCTATTAGAGCTAGATTGG- 3 ' 

(SEQ ID NO: 58) 

CCAATCTAGCTCTAATAGGTGGAGCGAACAAAGC-3 ' 
Hpa I 

GCCAGGTGTTAACTTGACTGACGAAG-3 ' (SEQ ID NO : 59) 
TTCGTCAGTCAAGTTAACACCTGGC-3 ' 

Aat II 

GACGAAGACGTCGTTAACTTGATGGAC- 3 ' (SEQ ID NO: 60) 
GTCCATCAAGTTAACGACGTCTTCGTC-3 ' 

Asp I 

GTCCATTCGACACTGTCGCTAGAACTTC-3 ' (SEQ ID NO: 61) 
GAAGTTCTAGCGACAGTOTCGAATGGAC-3 ' 
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E2 7 7Q 5 ' - CTGACGCTACTCAGCTGTCTCCATTC - 3 ' ( SEQ ID NO: 62) 

5 ' -GAATGGAGACAGCTGAGTAGCGTCAG-3 ' 

5 A283D 5 ' -GTCTCCATTCTGTGATTTGTTCACTCAC-3 ' (SEQ ID NO: 63) 

5 ' -GTGAGTGAACAAATCACAGAATGGAGAC-3 ' 

Ksp I 

H287A 5' -GCTTTGTTCACCGCGGACGAATGGAG-3' (SEQ ID NO: 64) 
10 5 ' - CTCCATTCGT CCGCGCTGAACAAAGC - 3 ' 

Bam HI 

R291I 5 ' - CACGACGAATGGATCCAATACGACTAC- 3 ' (SEQ ID NO: 65) 

5 ' - GTAGTCGTATTGGA TCCATTCGTCGTG - 3 ' 

15 

Bsi WI 

Q292A 5 ' - GACGAATGGAGAGCG TA CGACTACTTG - 3' (SEQ ID NO: 66) 

5' -CAAGTAGTCGTACGCTCTCCATTCGTC-3 ' 

20 Hpa I 

A3 2 0V 5 ' - GGTGTTGGTTTCGTTAACX3AATTGATTGC- 3 ' ( SEQ ID NO : 67) 

5 ' - GCAATCAATTCGTTAA CG AAACCAAC AC C - 3 ' 

(Bgl II) 

25 R329H 5' -GCTAGATTGACTCACTCTCCAGTTCAAG-3' (SEQ ID NO: 68) 

5 ' -CTTGAACTGGAGAGTGAGTCAATCTAGC-3 9 

Eco RV 

S3 64T 5 ' - CTCACGACAACACTATGATATCTATTTTCTTC- 3 ' ( SEQ ID NO : 69) 
30 5 ' - GAAGAAAATAGATA TCATAGTGTTGTCGTGAG - 3 ' 

Nco I 

I366V 5' -CGACAACT CCA TGOTTTCTATTTTCTTCGC - 3' (SEQ ID NO: 70) 

5 ' -GCGAAGAAAATAGAAACCA7UGAGTTGTCG-3 ' 

35 

Kpn I 

A3 7 9K 5 ' - GTACAACGGTACCAAGCCATTGTCTAC - 3' (SEQ ID NO: 71) 

5 9 -GTAGACAATGGCTTGGTACCGTTGTAC-3 ' 

40 S3 96A 5 ' -CTGACGGTTACGCTGCTTCTTGGAC-3 ' (SEQ ID NO : 72) 

5 ' - GTCCAAGAAGCAGCGTAACCGTCAG - 3 ' 

G4 04A 5 ' - CTGTTCCATTCGCTGCTAGAGCTTAC- 3 ' (SEQ ID NO: 73) 

5 9 - GTAAGCTCTAGCAGCGAATGGAACAG - 3 ' 

45 

Q4 1 5E 5 ' - GATGCAATGTGAAGCTGAAAAGGAACC - 3' (SEQ ID NO: 74) 

5 ' - GGTTCCTTTTCAGCTTCACATTGCATC - 3 ' 



WO 00/43503 



41 



PCT/DK00/00025 



Sal I 

A437G 5' -CACGGTTGTGGTGTCGACAAGTTGGG-3' (SEQIDNO: 75) 

5' -CCCAACTTGTOGAGACCACAACCGTG-3 ' 

5 

Mun I 

A463E 5 ' -GATCTGGTGGCAATTGGGAGGAATGTTTCG-3 ' (SEQ ID NO; 76) 

5' -CGAAACATTCCTCCCAAT2X?CCACCAGATC-3 ' 

io and, accordingly, for other mutations. 

The temperature optimum of the purified phytases, 
expressed in Saccharomyces cerevisiae (Example 7) , was 
determined as outlined in Example 9. Table 5 shows the effect of 
15 each mutation introduced on the stability of consensus phytase- 
1. 

Table 5 

Stability ef fect of the individual amino acid replacements in 
20 consensus phytase-1 

+ or - means a positive, respectively, negative effect on the 
protein stability up to i°c, ++ and -- means a positive, 
respectively, negative effect on the protein stability between 1 
and 3°C; the numbers 10 or 11 in parentheses indicate the 
25 consensus phytase sequence that suggested the amino acid 
replacement . 
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stabilizing 


neutral 




destabilizing 


mutation 


effect 


mutation 


ef- 


mutation effect 










fect 






E58A (10) 


■ 


D69A 




± 


YS4P 


(in) 


D69K (11) 


+ 


D70G (10) 


± 


V73 1 




D197N (10) 


+ 


N1340 


(10) 


± 


A94K 


(10) 


T214L (10) 


+ + 


G186H 




± 


A101R 


\ -L J. / — 


E222T (11) 


+ + 


S187A 


(10) 

\ -L. \-/ / 


± 


Kl SIM 


( 11 ) 

1-1-1/ — 


E267D (10) 


+ 


T214V 




± 


J — 1 > O V 


MO) - - 


R291I 


+ 


T251N 


(10) 


± 






R329H (10) 


+ 


Y259N 


(10) 


± 






S364T (10) 


+ + 


A283D 


(10) 


± 


ATI 7V 




A379K (11) 


+ 


A3 2 0V 


(10) 


± 


V227A 


(id - - 


G404A (10) 


+ + 


K445T 




± 


L234V 


(10) 






A463E 


(10) 


± 


A238P 


(10) - - 












E277Q 


(10) 












H287A 


(11) 












Q2 92A 














I366V 


(10) 












S396A 


(10) - - 












Q415E 


(11) 












A437G 


(10) - - 












E451R 





We combined eight positive mutations (E58A, D197N, E267D, 
R291I, R329H, S364T, A379K, G404A) in consensus phytase-1 
5 thermo[8], using the primers and the technique mentioned above 
in this example. Furthermore, the mutations Q50T and/or K91A 
were introduced which mainly influence the catalytic 
characteristics of phytase (see patent applications EP 897010 
and EP 897985, as well as Example 9) . The DNA and amino acid 

10 sequence of the resulting phytase (consensus phytase- 1- 
thermo [8] -Q50T-K91A) are shown in Figure 7. In this way, the 
temperature optimum and the melting point of the consensus 
phytase were increased by 7°C (Figures 15, 16, 17) . 

In a further consensus protein, we combined eleven 

15 positive mutations (E58A, D69K, D197N, T214L, E222T, E267D, 
R291I, R329H, S364T, A379K, G404A) in consensus phytase-1 
thermo[ll]. Furthermore, the mutations Q50T and/or K91A were 



WO 00/43503 PCT/DKOO/00025 

43 

introduced. In this way, the melting temperature was increased 
by another 3-4°C when compared to consensus phytase-1 thermo[8] . 

Using the results of Table 5, we further improved the 
thermostability of consensus phytase-10 by the back mutations 
5 K94A, V158I, and A3 96S, the reverse of which (A94K, 1 158V, and 
S396A) revealed a strong negative influence on the stability of 
consensus phytase-1. The resulting protein was called consensus 
phytase- 10 -thermo [3] . SEQ ID NO: 26 plus the three mutations 
K94A, V158I, and A3 96S. Furthermore, we introduced the mutations 

10 Q50T and K91A that mainly influence the catalytic 
characteristics of consensus phytase (see patent applications EP 
897010 and EP 897985, as well as Example 9 and Figures 14 and 
15) . The resulting DNA and amino acid sequence are shown in 
Figure 8. The optimized phytase showed a 4°C higher temperature 

15 optimum and melting point than consensus phytase-10 (Figures 12 
and 13) . Furthermore, the phytase has also a strongly increased 
specific activity with phytate as substrate of 250 U/mg at pH 
5.5 (Figure 14) . 

In a still further consensus protein, two additional 

20 mutations were introduced into consensus phytase-10 thermo[3] 
(E222T, G437A) which yielded consensus phytase-10 thermo [5] . 
Furthermore, the mutations Q50T and/or K91A were introduced. In 
this way, the melting temperature was increased by another 1-2°C 
when compared to consensus phytase-10 thermo [3], 

25 
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Example 4 

Stabilization of the phytase of A. fumigatus ATCC 13 073 by 
replacement of amino acid residues with the corresponding 
consensus phytase- 1 and/or consensus phytase- 10 residues 

5 

At six amino acid sequence positions where A. fumigatus 
13 073 phytase is the only or nearly the only phytase in the 
alignment of Figure 1 that does not contain the corresponding 
consensus phytase amino acid residue, the non- consensus amino 

10 acid residue was replaced by the consensus one. The following 
amino acids were substituted in A. fumigatus 13073 phytase, 
containing additionally the Q51(24)T substitution (influencing 
the catalytic properties and corresponding to the Q50T 
substitution in the consensus phytases) and the signal sequence 

is of A. terreus cbsll6.4 6 phytase (see European Patent Application 
No. 0897010, and Figure 9): F55(28)Y, V100(73)I, F114(87)Y, 
A243(220)L, S265(242)P, N294(282)D. The numbers in parentheses 
refer to the numbering in Figure 1 . 

In a second round, four of the seven stabilizing amino 

20 acid exchanges (E58A, R329H, S364T, G404A) identified in 
consensus phytase-10 and tested as single mutations in consensus 
phytase-1 (Table 5) were additionally introduced into the A. 
fumigatus alpha-mutant. Furthermore, the amino acid replacement 
S154N, shown to reduce the protease susceptibility of the 

25 phytase, was introduced. 

The mutations were introduced as described in Example 3 
(see Table 6) and expressed as described in Examples 6 to 8 . The 
resulting A. fumigatus 13073 phytase variants were called alpha- 
mutant (i.e. the A. fumigatus ATCC 13073 phytase with the 

30 substitutions Q24T, F28Y, V73I, F87Y, A220L, S242P, N282D) and 
x * optimized' ' alpha-mutant (i.e. the A. fumigatus alpha-mutant 
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having the additional substitutions E59A-S154N-R329H-S364T- 
G404A) . K92A is an additional preferred mutation. 

The temperature optimum (60°C, Figure 20) and the melting 
temperature (67.0°C, Figure 19) of the A. fumigatus 13073 alpha- 
mutant phytase were increased by 5-7°C in comparison to the 
values of the wild-type phytase (temperature optimum: 55°C / Tm: 
60 °C) . The five additional amino acid replacements further 
increased the temperature optimum by 3°C (Figure 20) . 

Table 6 

Mutagenesis primers for the stabilization of A. fumigatus ATCC 
13073 phytase 



Mutation Primer 

F 5 5 Y 5 ' - CACGTACTCGCCATACTTTTCGCTCGAG - 3' (SEQ ID NO: 77) 

5' - CTCGAGCGAAAAGTATGGCGAGTACGTG - 3 ' 

(Xho I) 

E5 8A 5 ' - C CATAC TTTTCG CTCGC GGACGAGC TGTC CGTG - 3 ' 

(SEQ ID NO: 78) 

5 ' -CACGGACAGCTCGTCOGrCGAGCGAAAAGTAGG-3 ' 
VI 0 0 1 5 ' - GTATAAGAAGCTTATTACGG CGATC CAGGC C - 3 ' 

(SEQ ID NO: 79) 

5 ' -GGCCTGGATCGCCGTAATAAGCTTCTTATAC-3 ' 
Fl 1 4 Y 5 ' - CTTCAAGGGCAAGTACGCCTTTTTGAAGACG- 3 ' 

(SEQ ID NO: 80) 

5 ' - CGTCTTCAAAAAGGCGTACTTGCCCTTGAAG- 3 ' 

A2 4 3 L 5 9 - CATCCGAGCTCGCCTCGAGAAGCATCTTC - 3' (SEQ ID NO: 81) 

5 ' - GAAGATGCTTCTCGAGGCGAGCTCGGATG - 3 ' 

S265P 5 ' - CTAATGGA TGTGTCCGTTTGATACGGTAG-3 ' (SEQ ID NO: 82) 

5' -CTACCGTATCAAACGGACACATGTCCATTAG-3 ' 

N2 94D 5 ' - GTGGAAGAAGTACGACTACCTTCAGTC - 3' (SEQ ID NO: 83) 

5 ' -GACTGAAGGTAGTCGTACTTCTTCCAC-3 ' 
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(Mlu I) 

R329H 5 ' -GCCCGGTTGACGCATTCGCCAGTGCAGG- 3 ' (SEQ ID NO: 84) 

5 ' -CCTGCACTGGCGAATGCGTCAACCGGGC-3 ' 

5 Nco I 

S3 64T 5 ' - CACACGACAACAC CA TGGTTTCCATCTTC - 3 ' ( SEQ ID NO : 85) 

5 ' - GAAGATGGAAA CCA TGGTGTTGTCGTGTG - 3 ' 

(Bss HI) 

10 G404A 5 ' -GTGGTGCCTTTCGCCGCGCGAGCCTACTTC-3 ' (SEQ ID NO : 86) 

5 ' -GAAGTAGGCTCGCGCGGCGAAAGGCACCAC- 3 ' 

Example 5 

Introduction of the active site amino acid residues of A. 
15 niger NRRL 3135 phytase into consensus phytase-1 

We used the crystal structure of Aspergillus niger 
NRRL 3135 phytase to define all active site amino acid residues 
(see Example 1, and EP 897010) . Using the alignment of Figure 1, 
we replaced the following active site residues and additionally 
20 the non- identical adjacent ones of consensus phytase- 1 by those 
of A. niger phytase: 

S89D, S92G, A94K, D164S, P201S, G203A, G205S, H212P, 
G224A, D226T, E255T, D256E, V258T, P265S, Q292H, G300K, Y305H, 
A314T, S364G, M365I, A397S, S398A, G404A, and A405S. 
25 The new consensus. phytase -7 protein sequence was 

backtranslated into a DNA sequence (Figure 10) as described in 
Example 1. The corresponding gene (fcp7) was generated as 
described in Example 1 using the following oligonucleotide 
mixes : 

30 Mix 1.7: CP-1, CP-2, CP-3, CP-4.7, CP-5.7, CP-6, CP-7, CP- 

8.7, CP-9, CP-10.7 
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Mix 2.7 



CP-9, CP-10.7, CP-11.7, CP-12.7, CP-13.7, CP-14.7, 
CP-15.7, CP-16, CP-17.7, CP-18.7, CP-19.7, CP-20, 
CP-21, CP-22. 
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The DNA sequences of the oligonucleotides are indicated in 
Figure 10. The newly synthesized oligonucleotides are 
additionally marked by the number 7. After assembling of the 
oligonucleotides using the same PCR primers as mentioned in 
Example 1, the gene was cloned into an expression vector as 
described in Examples 6-8. 

The pH-profile of the enzyme determined after expression 
in H. polymorpha and purification was very similar to that of A. 
niger phytase (see Figure 18) . 

Example 6 

Expression of the consensus phytase genes in Hansenula 
polymorpha 

The phytase expression vectors used to transform H. polymorpha 
RB11 [Gellissen, G. , Hollenberg, C. P., Janowicz, Z. A. (1994) 
Gene expression in methylotrophic yeasts, in Smith, A. (ed.) 
Gene expression in recombinant microorganisms. Dekker, New York, 
pp. 395-439] were constructed by inserting the Eco RI fragment 
of pBsk-fcp or variants thereof into the multiple cloning site 
of the H. polymorpha expression vector pFPMT121, which is based 
on an ura3 selection marker from S. cerevisiae, a formate dehy- 
drogenase (FMD) promoter element and a methanol oxidase (MO) 
terminator element from H. polymorpha. The 5' end of the fcp 
gene is fused to the FMD promoter, the 3' end to the MOX termi- 
nator (Gellissen et al . , Appl . Microbiol. Biotechnol . 46, 46-54, 
1996; EP 2 99108) . The resulting expression vectors are desig- 
nated pFPMTfcp, pFPMTfcplO, and pFPMTf cp7 . 

The constructed plasmids were propagated in E. coli. 
Plasmid DNA was purified using standard state of the art 
procedures. The expression plasmids were transformed into the H. 
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polymorpha strain RB11 deficient in orotidine- 5 ' -phosphate 
decarboxylase (ura3) using the procedure for preparation of 
competent cells and for transformation of yeast as described in 
Gellissen et al . (1996). Each transformation mixture was plated 
on YNB medium (0.14% w/v Difco YNB and 0.5% ammonium sulfate) 
containing 2% glucose and 1.8% agar, and incubated at 37 °C. 
After 4 to 5 days individual transformant colonies were picked 
and grown in the liquid medium described above for 2 days at 3 7 
°C. Subsequently, an aliquot of this culture was used to 
inoculate fresh vials with YNB-medium containing 2% glucose. 
After seven further passages in selective medium, the expression 
vector had integrated into the yeast genome in multimeric form. 
Subsequently, mitotically stable transf ormants were obtained by 
two additional cultivation steps in 3 ml non-selective liquid 
medium (YPD, 2% glucose, 10 g/1 yeast extract, and 20 g/1 
peptone) . In order to obtain genetically homogeneous recombinant 
strains, an aliquot from the last stabilization culture was 
plated on a selective plate. Single colonies were isolated for 
analysis of phytase expression in YNB containing 2% glycerol 
instead of glucose to derepress the FMD promoter. Purification 
of the consensus phytases was done as described in Example 7. 

Example 7 

Expression of the consensus phytase genes in Saccharomyces 
cerevisiae and purification of the phytases from the 
culture supernatant 

The consensus phytase genes were isolated from the 
corresponding Bluescript-plasmid (pBsk-fcp, pBSK-fcplO, pBsk- 
fcp7) and ligated into the Eco RI sites of the expression 
cassette of the Saccharomyces cerevisiae expression vector pYES2 
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(Invitrogen, San Diego, CA, USA) or subcloned between the 
shortened GAPFL (glyceraldhyde - 3 -phosphate dehydrogenase ) 
promoter and the pho5 terminator as described by Janes et al . , 
Curr. Genet. 18, 97-103. The correct orientation of the gene was 
checked by PCR. Transformation of S. cerevisiae strains, e.g. 
INVScl (Invitrogen, San Diego, CA, USA) , was done according to 
Hinnen et al . , Proc . Natl. Acad. Sci . USA 75, 1929-1933 (1978). 
Single colonies harboring the phytase gene under the control of 
the GAPFL promoter were picked and cultivated in 5 ml selection 
medium [SD-uracil; Sherman, J. P., Finck, G. R. & Hicks, J. B. 
(1986) Laboratory course manual for methods in yeast genetics. 
Cold Spring Harbor University] at 3 0°C under vigorous shaking 
(250 rpm) for one day. The preculture was then added to 500 ml 
YPD medium (Shearman et al . , 1986) and grown under the same 
conditions. Induction of the gall promoter was done according to 
the manufacturer's instructions. After four days of incubation, 
the cell broth was centrifuged (7000 rpm, GS3 rotor, 15 min, 
5°C) to remove the cells, and the supernatant was concentrated 
by way of ultrafiltration in Amicon 8400 cells (PM30 membranes ; 
Grace AG, Wallizeller, Switzerland) and ultrafree-15 centrifugal 
filter devices (Biomax-3 0K, Millipore, Bedford, MA, USA) . The 
concentrate (10 ml) was desalted on a 40 ml Sephadex G25 
Superfine column (Pharmacia Biotech, Freiburg, Germany) , with 10 
mM sodium acetate, pH 5.0, serving as elution buffer. The 
desalted sample was brought to 2 M (NH 4 ) 2 S0 4 and directly loaded 
onto a 1 ml Butyl Sepharose 4 Fast Flow hydrophobic interaction 
chromatography column (Pharmacia Biotech, Feiburg, Germany) 
which was eluted with a linear gradient from 2 M to 0 M (NHJ ,S0 

4 2 4 

in 10 mM sodium acetate, pH 5.0. Phytase was eluted in the 
breakthrough, concentrated and loaded on a 12 0 ml Sephacryl S- 
300 gel permeation chromatography column (Pharmacia Biotech, 
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Freiburg, Germany) . Consensus phytases -1, -7 and -10 eluted as 
a homogeneous symmetrical peak and were shown by SDS-PAGE to be 
approx . 9 5 % pure . 



Example 8 

Expression of the consensus phytase genes in Aspergillus 
niger 

The Bluescript-plasmids pBsk-fcp, pBsk-fcplO, and pBsk- 
fcp7 were used as template for the introduction of a Bsp HI -site 
upstream of the start codon of the genes and an Eco RV-site 
downstream of the stop codon. The ExpandTM High Fidelity PCR Kit 
(Boehringer Mannheim, Mannheim, Germany) was used with the 
following primers: 

Primer Asp-1 : 

Bsp HI 

5 1 -TATATCATGAGCGTGTTCGTCGTGCTACTGTTC-3 1 (SEQ ID NO : 87) 
Primer Asp-2 used for cloning of fcp and f cp7 : 



Eco RV 

3 ' -ACCCGACTTACAAAGCGAATTCTATAGATATAT-5 ■ (SEQ ID NO: 88) 
Primer Asp-3 used for cloning of fcplO: 

Eco RV 

3' -ACCCTTCTTACAAAGCGAATTCTATAGATATAT-5' (SEQ ID NO : 8 9) 

The reaction was performed as described by the supplier. 
The PCR-amplif ied fcp-genes had a new Bsp HI site at the start 
codon, introduced by primer Asp-1, which resulted in a 
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replacement of the second amino acid residue glycine by serine. 
Subsequently, the DNA- fragment was digested with Bsp HI and Eco 
RV and ligated into the Nco I site downstream of the 
glucoamylase promoter of Aspergillus niger (glaA) and the Eco RV 
site upstream of the Aspergillus nidulans tryptophan C 
terminator (trpC) (Mullaney et al . , 1985). After this cloning 
step, the genes were sequenced to detect possible errors 
introduced by PCR. The resulting expression plasmids, which 
basically correspond to the pGLAC vector as described in Example 
9 of EP 684313, contained the orotidine-5 ' -phosphate 
decarboxylase gene (pyr4) of Neurospora crassa as a selection 
marker. Transformation of Aspergillus niger and expression of 
the consensus phytase genes was done as described in EP 684313. 
The consensus phytases were purified as described in Example 7. 

Example 9 

Determination of phytase activity and of the pH and 
temperature optima 

This example relates i.a. to the determination of phytase 
activity and of the temperature optimum. Various phytases have 
been tested. 

The phytase of Aspergillus niger NRRL 313 5 was prepared as 
described in EP 420358 and by van Hartingsveldt et al . (Gene 
127, 87-94, 1993) . 

The phytases of Aspergillus fumigatus ATCC 13073, 
Aspergillus terreus 9A-1, Aspergillus terreus cbsll6.46, 
Emericella nidulans, Myceliophthora thermophila, and 
Talaromyces thermophilus were prepared as described in 
EP-0897985 and in the references therein. 

The remaining phytases tested were prepared as described 
herein . 
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Consensus phytase-1- thermo (8) designates a variant of 
consensus phytase-1, which further comprises the eight mutations 
which are underlined in the legend to Figure 5. Consensus 
phytase-1 is shown in Fig. l (SEQ ID NO: 14) without signal 
peptide, and in Fig. 2 (SEQ ID NO: 16) with the signal peptide. 

Phytase activity was determined basically as described by 
Mitchell et al . (1997). The activity was measured in an assay 
mixture containing 0.5% phytic acid («5 mM) in 200 mM sodium 
acetate, pH 5.0. After 15 min of incubation at 37°C, the reac- 
tion was stopped by addition of an equal volume of 15% tri- 
chloroacetic acid. The liberated inorganic phosphate was quanti- 
fied by mixing 100 jxl of the assay mixture with 900 fil H 2 0 and 1 
ml of 0.6 M H 2 S0 4/ 2% ascorbic acid and 0.5% ammonium molybdate. 
Standard solutions of potassium phosphate were used as refer- 
ence. One unit of enzyme activity was defined as the amount of 
enzyme that releases 1 ^imol phosphate per minute at 3 7°C. The 
protein concentration was determined using the enzyme extinction 
coefficient at 280 nm calculated according to Pace et al . [Pace 
N. C, Vajdos, F . , Fee, L . , Grimsley, G. & Gray, T. (1995) How 
to measure and predict the molar absorption coefficient of a 
protein. Prot . Sci . 4, 2411-2423]: 1 absorption unit (1 OD) at 
280 nm corresponds to 1.101 mg/ml of consensus phytase-1, 1.068 
mg/ml of consensus phytase-7, and 1.039 mg/ml of consensus phy- 
tase-10 . 

In case of pH-optimum curves, the purified enzymes were 
diluted in 10 mM sodium acetate, pH 5.0. Incubations were 
started by mixing aliquots of the diluted protein with an equal 
volume of 1% phytic acid («io mM) in a series of different 
buffers: 0.4 M glycine/HCl, pH 2.5; 0.4 M acetate/NaOH, pH 3.0, 
3.5, 4.0, 4.5, 5.0, 5.5; 0.4 M imidazole/HCl , pH 6.0, 6.5; 0.4 M 
Tris/HCl pH 7.0, 7.5, 8.0, 8.5, 9.0. Control experiments showed 
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that pH was only slightly affected by the mixing step. 
Incubations were performed for 15 min at 37°C as described 
above . 

For determination of the substrate specificities of the 
phytases, phytic acid in the assay mixture was replaced by 5 mM 
concentrations of the respective phosphate compounds. Besides, 
the activity tests were performed as described above. 

For determination of the temperature optimum, enzyme (100 
111) and substrate solution (100 /il) were pre-incubated for 5 min 
at the given temperature. The reaction was started by addition 
of the substrate solution to the enzyme. After 15 min of 
incubation, the reaction was stopped with trichloroacetic acid, 
and the amount of phosphate released was determined. 

The pH-optimum of consensus phytase-1 was around pH 6.0- 
6.5 (70 U/mg) . Introduction of the Q50T mutation shifted the 
pH-optimum to pH 6.0 (130 U/mg). Introduction of the K91A 
mutation further shifted the pH optimum into the more acidic pH- 
range. Comparable effects of the Q50T and K91A mutations were 
also observed for consensus phytase-10 and for further 
stabilized consensus phytase variants (Figures 14 and 15) . 

Consensus phytase-7, which was constructed to transfer the 
catalytic characteristics of A. niger NRRL 3135 phytase to 
consensus phytase-1, had a pH-profile very similar to that of A. 
niger NRRL 3135 phytase (see Figure 18) . The substrate 
specificity also resembled more that of A. niger NRRL 3135 
phytase than that of consensus phytase-1. 

The temperature optimum of consensus phytase-1 (71°C) was 
16-26°C higher than the temperature optima of the wild-type 
phytases (45-55°C, Table 7) that were used to calculate the 
consensus sequence. The improved consensus phytase-10 showed a 
further increase of its temperature optimum to 8 0°C (Figure 13) . 
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The temperature optimum of consensus phytase-1- thermo [8] 
was found to be in the same range (78 °C) when using the 
supernatant of an overproducing S. cerevisiae strain. The 
highest temperature optimum reached of 82 °C was determined for 
consensus phytase- 10 -thermo [3] -Q50T-K91A. Table 7 

Temperature optima and Tm- values of consensus phytase and 
of the phytases from A. fumigatus, A. niger, E . nidulans, 
and M. thermophila. 

The determination of the temperature optimum was performed 
as described in Example 9. The Tm-values were determined by 
differential scanning calorimetry as described in Example 10. 



Phytase 


Optimum temperature 
<°C) 


Tm (°C) 


Aspergillus niger 
NRRL 313 5 


55 


63 .3 


Aspergillus 
fumigatus ATCC 13 073 


55 


62 .5 


Aspergillus terreus 
9A-1 


49 


57 .5 


Aspergil lus terreus 
cbsll6 .46 


45 

* 


58 . 5 


Emericella nidulans 


45 


55.7 


Mycel iophthora 
thermophila 


55 




Tal aromyce s 
thermophilus 


45 




Consensus phytase - 
10-thermo[5] -Q50T- 
K91A 




90 .4 


Consensus -phytase - 
10-thermo [3] -Q50T- 
K91A 


82 


89.3 
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Consensus - phytase - 
10-thermo [3] -Q50T 


82 


88.6 


Consensus-phytase-10 


80 


85 .4 


thermo [11] -Q50T-K91A 




88 . 0 


thermo [11] -Q50T 




88.5 


VrVJllOCllOliQ LJlly LdOC _L 

thermo [8] -Q50T-K91A 




85 . 7 


thermo [8] -Q50T 


78 


84 . 7 


Consensus -phytase - 1 - 
thermo [8] 


81 




Consensus -phytase - 1 - 
thermo [3 ] 


75 




Consensus -phytase - 1 - 
Q50T 




78 . 9 


Consensus -phytase - 1 


71 


78.1 


AsDerai 1 lus 

aA. V~J V— JL. -L. «1« J_ l^L kZ? 

fumigatus a-mutant 
Q51T 


D U 


67.0 


Aspergillus 
fumigatus a-mutant, 

Y"i 1 l i o Tm i \— *ii \— -5 s~\ y""* L' CT Q 7\ 

piub iiiuuauiuns iioy/i, 
S154N, R329H, S364T, 
G404A 


63 




Aspergillus 
fumigatus 
x x opt imi zed ' ' alpha - 
mutant, plus 
mutation K92A 


63 
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Example 10 

Determination of the melting temperature by differential 
scanning calorimetry (DSC) 

In order to determine the unfolding temperature of the 
phytases, differential scanning calorimetry was applied as de- 
scribed by Brugger et al . , 1997 [Brugger, R. , Mascarello, F., 
Augem, S., van Loon, A. P. G. M. & Wyss, M. (1997). Thermal de- 
naturation of phytases and pH 2.5 acid phosphatase studied by 
differential scanning calorimetry. In The Biochemistry of phy- 
tate and phytase (eds. Rasmussen, S.K.; Raboy, V.; Dalbege, H. 
and Loewus, F.; Kluwer Academic Publishers, Dordrecht, the Neth- 
erlands] . Solutions of 50-60 mg/ml of homogeneous phytase were 
used for the tests. A constant heating rate of 10°C/min was ap- 
plied up to 90-95°C. 

The determined melting points confirm the results obtained 
for the temperature optima (Table 7) . The most stable consensus 
phytase designed so far is consensus phytase- 10 -thermo [3] -Q50T- 
K91A showing a melting temperature under the chosen conditions 
of 89.3°C. This is 26.0 to 33.6°C higher than the melting 
temperature of the wild- type phytases used. 

Example 11 

Transfer of basidiomycete phytase active site into 
consensus phytase- 10 -thermo [3] -Q50T-K91A 

As described previously (Example 5) , mutations derived 
from the basidiomycete phytase active sites were introduced into 
consensus phytase-10. The following five constructs a) to e) 
were prepared: 

a) The construct called consensus phytase- 12, and it 
comprises a selected number of active site residues of the 
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basidio consensus sequence. Its amino acid sequence is shown in 
Fig. 21 (the first 26 amino acids form the signal peptide; 
positions differing from consensus phytase- 10 -thermo [3] -Q50T- 
K91A are underlined) ; 

5 b) a cluster of mutations (Cluster II) was transferred to 

the consensus phytase-1 and -10 sequences, viz.: S80Q, Y86F, 
S90G, K91A, S92A, K93T, A94R, Y95I; 

c) in a similar way, another cluster of mutations (Cluster 
III) was transferred, viz.: T129V, E133A, Q134N, M136S, V137S, 

10 N138Q, S139A; 

d) in a similar way, a further cluster of mutations 
(Cluster IV) was transferred, viz.: A168D, E171T, K172N, F173W; 

e) and finally, a further cluster of mutations (Cluster V) 
was transferred, viz.: Q297G, S298D, G300D, Y305T. 

15 These constructs were expressed as described in Examples 6 

to 8 . 



Example 12 

Phytase alignment using GAP 
20 The phytases described herein - i.e. the amino acid 

sequences as well as the corresponding DNA sequences - were 
aligned against each other. Also some other phytases were 
correspondingly aligned, viz. the following: 

- the consensus phytase-1 described in EP 897985; 

25 - the phytase derived from Aspergillus niger (ficuum) NRRL 3135 
(A. niger NRRL3135) described in EP 420358; 

- the phytases derived from Aspergillus fumigatus ATCC 13073 (A. 
fumigatus 13 073) ; Aspergillus fumigatus ATCC 3223 9 (A. 
fumigatus 32239); Aspergillus terreus cbsll6.46 (A.terreus cbs) ; 

30 Emericella nidulans (E. nidulans) ; and Talaromyces thermophilus 
(T. thermophilus) - all described in EP 897010; 
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- the phytases derived from Myceliophthora thermophila (M. 
thermophila) ; and Aspergillus terreus 9-A1 (A. terreus 9-A1) - 
both described in EP 684313; 

the phytase derived from Thermomyces lanuginosus (T. 
lanuginosus) described in WO 9735017 (PCT/US97/04559) ; 

- the phytases derived from Agrocybe pediades (A. pediades) , 
Paxillus involutus 1 and 2 (P. involutus phyAl and phyA2) ; and 
Trametes pubescens (T. pubescens) - all described in WO 
98/28409; and 

- the phytase derived from Peniophora lycii (P. lycii) described 
in WO 98/28408. 

For the alignments, the program GAP was used with the 
settings as described above. 

For polypeptide comparisons, the signal peptides were 
included with the exception of comparisons with consensus 
phytase- 11 . 

The results of the amino acid sequence comparisons are 
shown in Table 8 below. The first number in each cell is the 
amino acid similarity, the second number is the amino acid 
identity. 

For DNA sequence comparisons, the signal sequence was 
always included. The results are shown in Table 9 below. 

This invention comprises e.g. the following embodiments 
(A) to (J) that are described below. 

In these embodiments, when determining % identity or % 
similarity at the amino acid level for another phytase, its 
amino acid sequence is aligned with the reference sequence (e.g. 
in embodiment (A) the consensus phytase- 10 amino acid sequence) , 
using an alignment program such as GAP referred to above. 
Percentage identity, as well as percentage similarity, is 
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calculated by the program. The amino acid sequence of the other 
phytase may or may not include the signal peptide. 

When determining % identity on the DNA level for another 
phytase -encoding DNA, this DNA sequence is aligned with the 
reference sequence [e.g. in embodiment (A) nucleotides 12-1412 
of SEQ ID NO: 25 (the DNA sequence of consensus phytase -10 
(FcplO) as shown in Fig. 5], using an alignment program such as 
GAP referred to above. Percentage identity is calculated by the 
program. The DNA sequence encoding the other phytase can be a 
genomic DNA sequence including introns, or it can be a cDNA 
sequence. It may or may not include the signal peptide -encoding 
part. 

When determining hybridization, the probe to be used is 
the specified DNA sequence [e.g. in embodiment (A) nucleotides 
12-1412 of SEQ ID NO: 25 (the DNA sequence of consensus phytase- 
10 (FcplO) as shown in Fig. 5)]. The DNA sequence encoding the 
other phytase can be a genomic DNA sample which contains a 
phytase -encoding DNA-sequence ; a purified genomic DNA sequence 
(purified with respect to the phytase -encoding DNA sequence) ; or 
it can be a phytase-encoding cDNA sequence, preferably purified 
or amplified, e.g. PCR-amplif ied . The phytase-encoding DNA, 
whatever type, may or may not include the signal peptide- 
encoding part. Suitable hybridization conditions are referred to 
above . 

The term DNA sequence" includes such fragments or parts 
of the herein exemplified DNA sequences, as long as they are 
capable of encoding an active enzyme (e.g. phytase) . 

The term ' ^ amino acid sequence" includes such fragments 
or parts of the herein exemplified amino acid sequences, as long 
as they are enzymatically active (e.g. displaying phytase 
activity) . 
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(A) Phytases and corresponding DNA sequences related to 
consensus phytase-10 (CP10, Fcp 10) 

A phytase that comprises an amino acid sequence which is 
at least 93.80%; or at least 94, 94.5, 95, 95.5, 96, 96.5, 97, 
5 97.5, 98, 98.5, 99, 99.5% identical to the sequence of amino 
acids 1-467 of consensus phytase-10 (FcplO) as shown in Fig. 5. 

A phytase that comprises an amino acid sequence which is 
at least 95.09%; or at least 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% similar to the sequence of amino acids 1-467 of 
10 consensus phytase-10. 

A phytase which is encoded by a DNA sequence which is at 
least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical to nucleotides 12-1412 of the DNA sequence of 
consensus phytase-10 (FcplO) as shown in Fig. 5. 

15 A DNA sequence which encodes a phytase and which (i) is at 

least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical; or (ii) hybridizes under low, or medium, 
medium/high, high, or very high stringency conditions to 
nucleotides 12-1412 of the DNA sequence of consensus phytase-10 

20 (FcplO) as shown in Fig. 5. A suitable negative control is DNA 
encoding consensus phytase-1. A suitable positive control is DNA 
encoding any of CP10, CPlO-thermo [3] -Q50T, K91A, CP1 -thermo [8] , 
CPl-thermo [8] Q50T, K91A. 

A DNA sequence which encodes a phytase comprising an amino 

25 acid sequence which is at least 93.80%; or at least 94, 94.5, 
95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical to 
the sequence of amino acids 1-467 of consensus phytase-10 
(FcplO) as shown in Fig. 5. 
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(B) Phytases and corresponding DNA sequences related to 
consensus phytase- 10 -thermo [3] -Q50T-K91A 

A phytase which comprises an amino acid sequence which is 
at least 93.37%; or at least 93.5, 94, 94.5, 95, 95.5, 96, 96.5, 
5 97, 97.5, 98, 98.5, 99, 99.5% identical to the sequence of amino 
acids 1-467 of consensus phytase-10- thermo [3 ] -Q50T-K91A as shown 
in Fig. 8. 

A phytase which comprises an amino acid sequence which is 
at least 94.66%; or at least 95.0, 95.5, 96, 96.5, 97, 97.5, 98, 

10 98.5, 99, 99.5% similar to the sequence of amino acids 1-467 of 
consensus phytase-10- thermo [3] -Q50T-K91A as shown in Fig. 8. 

A phytase which is encoded by a DNA sequence which is at 
least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical to nucleotides 12-1412 of the DNA sequence of 

15 consensus phytase- 10 -thermo [3] -Q50T-K91A as shown in Fig. 8. 

A DNA sequence which encodes a phytase and which (i) is at 
least 95.88%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical; or (ii) hybridizes under low, or medium, 
medium/high, high, or very high stringency conditions to 

20 nucleotides 12-1412 of the DNA sequence of consensus phytase-10- 
thermo [3] -Q50T-K91A as shown in Fig. 8. A suitable negative 
control is DNA encoding consensus phytase-1. A suitable positive 
control is DNA encoding any of CP10, CPlO-thermo [3] -Q50T-K91A, 
CP1- thermo [8] , or CP1- thermo [8] -Q50T-K91A. 

25 A DNA sequence which encodes a phytase comprising an amino 

acid sequence which is at least 93.37%; or at least 93.5, 94, 
94.5, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to the sequence of amino acids 1-467 of consensus 
phytase -10 -thermo [3] -Q50T-K91A as shown in Fig. 8. 

30 
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(C) Phytases and corresponding DNA sequences related to 
consensus phytase- 1-thermo [8] 

A phytase which comprises an amino acid sequence which is 
at least 98.30%; or at least 98.5, 99, 99.5% identical to the 
sequence of amino acids 1-467 of consensus phytase- 1-thermo [8] 
(as shown in Fig. 7; backmutations T50Q and A91K to be added) . 

A phytase which comprises an amino acid sequence which is 
at least 98.51%; or at least 99, 99.5% similar to the sequence 
of amino acids 1-467 of consensus phytase- 1-thermo [8] (as shown 
in Fig. 7; backmutations T5 0Q and A91K to be added) . 

A phytase which is encoded by a DNA sequence which is at 
least 98.73%; or at least 99, 99.5% identical to nucleotides 1- 
1407 of the DNA sequence of consensus phytase- 1-thermo [8] (as 
shown in Fig. 7; backmutations T50Q and A91K to be added) . 

A DNA sequence which encodes a phytase and which (i) is at 
least 98.73%; or at least 99, 99.5% identical; or (ii) 
hybridizes under low, or medium, medium/high, high, or very high 
stringency conditions to nucleotides 1-14 07 of the DNA sequence 
of consensus phytase- 1-thermo [8] (as shown in Fig. 7; 
backmutations T50Q and A91K to be added) . A suitable negative 
control is DNA encoding consensus phytase-1. A suitable positive 
control is DNA encoding any of CPl-thermo [8] , CPl-thermo [8] - 
Q50T-K91A. 

A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 98.30%; or at least 98.5, 99, 
99.5% identical to the sequence of amino acids 1-4 67 of 
consensus phytase- 1-thermo [8] (as shown in Fig. 7; backmutations 
T50Q and A91K to be added) . 
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(D) Phytases and corresponding DNA sequences related to 
consensus phytase-1- thermo [8] -Q50T-K91A 

A phytase which comprises an amino acid sequence which is 
at least 97.87%; or at least 98, 98.5, 99, 99.5% identical to 
the sequence of amino acids 1-4 67 of consensus phytase-1 - 
thermo [8] -Q50T-K91A as shown in Fig. 7. 

A phytase which comprises an amino acid sequence which is 
at least 98.08%; or at least 98.5, 99, 99.5% similar to the 
sequence of amino acids 1-467 of consensus phytase- 1- thermo [8] - 
Q50T-K91A as shown in Fig. 7. 

A phytase which is encoded by a DNA sequence which is at 
least 98.37%; or at least 98.5, 99, 99.5% identical to 
nucleotides 1-1407 of the DNA sequence of consensus phytase-1- 
thermo [8] -Q50T-K91A as shown in Fig. 7. 

A DNA sequence which encodes a phytase and which (i) is at 
least 98.37%; or at least 98.5, 99, 99.5% identical; or (ii) 
hybridizes under low, or medium, medium/high, high, or very high 
stringency conditions to nucleotides 1-14 07 of the DNA sequence 
of consensus phytase-1- thermo [8] -Q50T-K91A as shown in Fig. 7. A 
suitable negative control is DNA encoding consensus phytase-1. A 
suitable positive control is DNA encoding any of CP1 -thermo [8] , 
CPl-thermo [8] -Q50T-K91A. 

A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 97.87%; or at least 98, 98.5, 
99, 99.5% identical to the sequence of amino acids 1-467 of 
consensus phytase 1 -thermo [8] -Q50T-K91A as shown in Fig. 7. 

SEl Phytases and corresponding DNA sequences related to 

consensus phytase- 11 

A phytase that comprises an amino acid sequence which is 
at least 90.71%; or at least 91, 91.5, 92, 92.5, 93, 93.5, 94, 
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94.5, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to the sequence of amino acids 1-482 of consensus 
phytase-11 as shown in Fig. 6. 

A phytase that comprises an amino acid sequence which is 
5 at least 92.07%; or at least 92.5, 93, 93.5, 94, 94.5, 95, 95.5, 
96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% similar to the sequence 
of amino acids 1-482 of consensus phytase-11 as shown in Fig. 6. 

A DNA sequence that encodes a phytase comprising an amino 
acid sequence which is at least 90.71%; or at least 91. 91.5, 
10 92, 92.5, 93, 93.5, 94, 94.5, 95, 95.5, 96, 96.5, 97, 97.5, 98, 
98.5, 99, 99.5% identical to the sequence of amino acids 1-482 
of consensus phytase-11 as shown in Fig. 6. 

(F) Phytases and corresponding DNA sequences related to A. 

15 fumigatus alpha -mutant 

A phytase that comprises an amino acid sequence which is 
at least 97.17%; or at least 97.5, 98, 98.5, 99, 99.5% identical 
to the sequence of amino acids 1-467 of A. fumigatus alpha- 
mutant (phytase) as shown in Fig. 9. 

20 A phytase that comprises an amino acid sequence that is at 

least 97.82%; or at least 98, 98.5, 99, 99.5% similar to the 
sequence of amino acids 1-467 of A. fumigatus alpha-mutant 
(phytase) as shown in Fig. 9. 

A phytase which is encoded by a DNA sequence which is at 

25 least 96.13%; or at least 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to nucleotides 1-1401 of the DNA sequence of A. 
fumigatus ATCC 13 073 alpha-mutant (phytase) as shown in Fig. 9. 

A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 97.17%; or at least 97.5, 98, 

30 98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
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of A. fumigatus ATCC 13073 alpha-mutant (phytase) as shown in 
Fig. 9. 

A DNA sequence which encodes a phytase and which (i) is at 
least 96.13%; or 96.5, 97, 97. 5, 98, 98.5, 99, 99.5% identical; 
or (ii) hybridizes under low, or medium, medium/high, high, or 
very high stringency conditions to nucleotides 1-1401 of the DNA 
sequence of A. fumigatus ATCC 13073 alpha-mutant (phytase) shown 
in Fig. 9. A suitable negative control is DNA encoding A. 
fumigatus 13 073 phytase. A suitable positive control is DNA 
encoding any one of the A. fumigatus ATCC 13 073 alpha mutant 
phytase or the optimised alpha-mutant. 

(G) Phytases and corresponding DNA sequences related to the 
optimized A. fumigatus alpha-mutant 

A phytase that comprises an amino acid sequence that is at 
least 96.08%; or at least 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical to the sequence of the phytase of the optimized A. 
fumigatus alpha -mutant . 

A phytase that comprises an amino acid sequence that is at 
least 96.74%; or at least 97, 97.5, 98, 98.5, 99, 99.5% similar 
to the sequence of the phytase of the optimized A. fumigatus 
alpha -mutant . 

A phytase which is encoded by a DNA sequence which is at 
least 95.63%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical to nucleotides 1-1401 of the DNA sequence 
encoding the optimized A. fumigatus alpha-mutant phytase. 

A DNA sequence that encodes a phytase comprising an amino 
acid sequence that is at least 96.08%; or at least 96.5, 97, 
97.5, 98, 98.5, 99, 99.5% identical to the optimized A. 
fumigatus alpha-mutant phytase. 
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A DNA sequence which encodes a phytase and which (i) is at 
least 95.63%; or at least 96, 96.5, 97, 97.5, 98, 98.5, 99, 
99.5% identical; or (ii) hybridizes under low, or medium, 
medium/high, high, very high stringency conditions to 
5 nucleotides 1-1401 of the DNA sequence encoding the optimized A. 
fumigatus alpha-mutant phytase. 

A suitable negative control is DNA encoding A. fumigatus 
ATCC 13073 phytase. A suitable positive control is DNA encoding 
any one of the A. fumigatus ATCC 13073 alpha mutant phytase of 
10 the optimised alpha-mutant. 

(H) Phytases and corresponding DNA sequences related to 
consensus phytase- 7 

A phytase that comprises an amino acid sequence which is 
15 at least 94.87%; or at least 95, 95.5, 96, 96.5, 97, 97.5, 98, 
98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
of consensus phytase-7 as shown in Fig. 10. 

A phytase that comprises an amino acid sequence which is 
at least 95.30%; or at least 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
20 99, 99.5% similar to the sequence of amino acids 1-4 67 of 
consensus phytase-7 as shown in Fig. 10. 

A phytase which is encoded by a DNA sequence which is at 
least 96.38%; or 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical 
to nucleotides 12-1412 of the DNA sequence of consensus phytase- 
25 7 shown in Fig. 10. 

A DNA sequence which encodes a phytase and which (i) is at 
least 96.38%; or at least 96.5, 97, 97.5, 98, 98.5, 99, 99.5% 
identical; or (ii) hybridizes under low, or medium, medium/high, 
high, or very high stringency conditions to nucleotides 12-1412 
30 of the DNA sequence of consensus phytase-7 as shown in Fig. 10. 
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A DNA sequence which encodes a phytase comprising an amino 
acid sequence which is at least 94.87%; or at least 95, 95. 5 , 
96, 96.5, 97, 97.5, 98, 98.5, 99, 99.5% identical to the 
sequence of amino acids 1-467 of consensus phytase- 7 as shown in 
5 Fig. 10. 

(I) Phytases related to basidio consensus phytase 

A phytase which comprises an amino acid sequence which is 
at least 76.23%; or at least 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 94.5, 95, 95.5, 96, 96.5, 

10 97, 97.5, 98, 98.5, 99, 99.5% identical to the combined sequence 
of (i) amino acids 1-441 of basidio consensus phytase shown in 
Fig. 3, and (ii) amino acids 1-26 shown in Fig. 5 (the sequence 
of (ii) to be added at the N-terminus of the sequence of (i) ) . 

A phytase that comprises an amino acid sequence which is 

15 at least 79.50%; or at least 80, 81, 82, 83, 84, 85, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% similar to the sequence of amino acids 1-441 of 
basidio consensus phytase as shown in Fig. 3. 
(J) Phytases related to consensus phytase- 12 

20 A phytase which comprises an amino acid sequence which is 

at least 70, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 

88, 89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 98, 
98.5, 99, 99.5% identical to the sequence of amino acids 1-467 
of consensus phytase- 12 as shown in Fig. 21. 

25 A phytase which comprises an amino acid sequence which is 

at least 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 

89, 90, 91, 92, 93, 94, 95, 95.5, 96, 96.5, 97, 97.5, 98, 98.5, 
99, 99.5% similar to the sequence of amino acids 1-467 of 
consensus phytase- 12 as shown in Fig. 21. 
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CLAIMS 

1. A phytase that comprises an amino acid sequence which is at 
least 93.80% identical to the sequence of amino acids 1-467 of 

5 consensus phytase- 10 (SEQ ID NO: 26) 

2 . A phytase that is encoded by a DNA sequence that is at least 
95.88% identical to nucleotides 12-1412 of the DNA sequence of 
consensus phytase- 10 (SEQ ID NO: 25) 

10 

3 . A phytase that comprises 

an amino acid sequence chosen from 

(i) SEQ ID NO: 26, or amino acids 1-438 thereof; or 
an amino acid sequence encoded by 
15 (ID nucleotides 12-1412, or 90-1412 of SEQ ID NO: 25. 

4 . A phytase that comprises 

an amino acid sequence chosen from 

(i) consensus phytase - 10 -thermo [3] , 
20 ( ±:L ) variants of (i) , further including the mutations 

Q50T, K91A, or (Q50T+K91A) , the latter variant being 
shown in Fig. 8, 

(iii) amino acids 27-467 of any of the sequences of (i) 
and (ii) , 

25 (iv) SEQ ID NO: 31, or amino acids 1-441 thereof; or 

an amino acid sequence encoded by 

(v) nucleotides 1-1401, or 79-1401 of SEQ ID NO: 30. 

5. A phytase that comprises 

30 an amino acid sequence chosen from 

(i) consensus phytase- 1 -thermo [8] , 
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(ii) variants of (i) , further including the mutations 



Q50T, K91A, or (Q50T+K91A) , the latter variant being 



shown in Fig. 7, 



(iii) amino acids 27-467 of any of the sequences of (i) 



5 



and (ii) , or 



(iv) SEQ ID NO: 29, or amino acids 1-441 thereof; or 



an amino acid sequence encoded by 



(v) nucleotides 1-1407, or 79-1407 of SEQ ID NO: 28. 

6. A phytase that comprises the amino acid sequence of 
consensus phytase -11 (SEQ ID NO: 27) . 

7. A DNA sequence that comprises a DNA- sequence encoding a 
phytase of any one of claims 1-6. 



8. A DNA sequence that comprises a DNA-sequence encoding a 
phytase, and wherein the phytase -encoding DNA-sequence is (i) at 
least 95.88% identical, or (ii) hybridizes under high stringency 
conditions, to nucleotides 12-1412 of the DNA sequence of 

20 consensus phytase -10 (SEQ ID NO: 25) . 

9. A DNA sequence that comprises a DNA-sequence that encodes a 
phytase, wherein the phytase comprises an amino acid sequence 
that is at least 93.80% identical to the sequence of amino acids 

25 1-467 of consensus phytase 10 (SEQ ID NO: 26) . 

10. A DNA sequence that comprises a DNA-sequence that encodes a 
phytase, and wherein the phytase-encoding DNA-sequence comprises 



15 



(i) 



nucleotides 12-1412, or 90-1412 of the DNA 
sequence of consensus phytase- 10 (SEQ ID NO: 
25) ; 



30 
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(ii) nucleotides 1-1401, or 79-1401 of the DNA 

sequence of consensus phytase- 10- thermo [3] - 
Q50T-K91A (SEQ ID NO: 30); or 

(iii) nucleotides 1-1401, or 79-1401 of the DNA 

sequence of consensus phytase- 1 -thermo [8] - 
Q50T-K91A (SEQ ID NO: 28) . 

11. A vector comprising a DNA sequence according to any one of 
claims 7-10. 

12. A microbial host cell comprising a DNA sequence according 
to any one of claims 7-10, or the vector according to claim 11. 

13. A process for producing a phytase, the process comprising 
culturing the host cell according to claim 12 under conditions 
permitting the production of the phytase, and recovering the 
phytase from the culture broth. 



14. A food, feed or pharmaceutical composition comprising a 
phytase of any one of claims 1-6. 
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AMENDED CLAIMS 

[received by the International Bureau on 16 June 2000 (16.06.00); 
original claims 1 to 14 replaced by new claims 1 to 14 (3 pages)] 

1. A phytase that comprises an amino acid sequence which is at 
least 93.80% identical to the sequence of amino acids -26 to 
+441 of consensus phytase 10 (SEQ ID NO: 26) 

2. A phytase that is encoded by a DNA sequence that is at 
least 95.88% identical to nucleotides 12-1412 of the DNA 
sequence of consensus phytase 10 (SEQ ID NO: 25) . 

3. A phytase that comprises 

an amino acid sequence chosen from 

(i) SEQ. ID NO: 26, or amino acids 1-438 thereof; or 
an amino acid sequence encoded by 

(ii) nucleotides 12-1412, or 90-1412 of SEQ ID NO:25. 

4 . A phytase that comprises 

an amino acid sequence chosen from 

(i) consensus phytase- 10 -thermo [3 ] , 

(ii) variants of (i) , further including the mutations 
Q50T, K91A, or (Q50T+K91A) , the latter variant 
being shown in Fig . 8 , 

(iii) amino acids 27-467 of any of the sequences of (i) 
and (ii) , 

(iv) SEQ ID NO: 31, or amino acids 1-441 thereof; or 
an amino acid sequence encoded by 

(v) nucleotides 1-1401, or 79-1401 of SEQ ID NO:30. 

5. A phytase that comprises 

an amino acid sequence chosen from 

(i) consensus phytase- 1- thermo [8] , 

(ii) variants of (i) , further including the mutations 
Q50T, K91A, or (Q50T+K91A) , the latter variant 
being shown in Fig. 7, 

AMENDED SHEET (ARTTCLE 19) 
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(iii) amino acids 27-467 of any of the sequences of (i) 
and (iv) , or 

(iv) SEQ ID NO: 29, or amino acids 1-441 thereof; or 
an amino acid sequence encoded by 

5 (v) nucleotides 1-1401, or 79-1401 of SEQ ID NO: 28. 

6. A phytase that comprises the amino acid sequence of 
consensus phytase - 11 (SEQ ID NO : 2 7) . 

10 7. A DNA sequence that comprises a DNA- sequence encoding the 
phytase of any one of claims 1-6. 

8. A DNA sequence that comprises a DNA-sequence encoding a 
phytase, and wherein the phytase-encoding DNA-sequence is (i) 

15 at least 95.88% identical, or (ii) hybridizes under high 
stringency conditions, to nucleotides 12-1412 of the DNA 
sequence of consensus phytase 10 (SEQ ID NO: 25) 

9. A DNA sequence that comprises a DNA-sequence that encodes a 
20 phytase, wherein the phytase comprises an amino acid sequence 

that is at least 93.80% identical to the sequence of amino 
acids -26 to + 441 of consensus phytase 10 (SEQ ID NO: 26). 

10. A DNA sequence that comprises a DNA-sequence that encodes 
25 a phytase, and wherein the phytase-encoding DNA-sequence 

comprises 

(i) nucleotides 12-1412, or 90-1412 of the DNA sequence 
of consensus phytase 10 (SEQ ID NO: 25) ; 

(ii) nucleotides 1-1401, or 79-1401 of the DNA sequence 
30 of consensus phytase - 10 -thermo [3] -Q50T, K91A (SEQ 

ID NO: 30) ; or 

(iii) nucleotides 1-1401, or 79-1401 of the DNA sequence 
of consensus phytase- 1 -thermo [8] -Q50T, K91A (SEQ ID 
NO: 28) . 
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11. A vector comprising the DNA sequence according to any one 
of claims 7-10. 

5 12. A microbial host cell comprising the DNA sequence 
according to any one of claims 7-10, or the vector according to 
claim 6. 

13. A process for producing a phytase, the process comprising 
10 culturing the host cell according to claim 12 under conditions 

permitting the production of the phytase, and recovering the 
phytase from the culture broth. 

14. A food, feed or pharmaceutical composition comprising the 
15 phytase of any one of claims 1-6. 
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A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13073 

A . fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 



KhsDCNSVDh 
NhsDCTSVDr 
NqsTCDTVDQ 
NqsSCDTVDQ 
NqsSCDTVDQ 
GSkSCDTVDl 
GSkSCDTVDl 
GSkSCDTVDl 
GSkSCDTVDl 
GSkACDTVEl 
QNHS CNTADG 
DSHSCNTVEG 
ESRPCDTpDl 



GYQCFPELSH 
GYQCFPELSH 
GYQCFSETSH 
GYQCFSETSH 
GYQCFSETSH 
GYQCsPATSH 
GYQCsPATSH 
GYQCsPATSH 
GYQCsPATSH 
GYQCsPGTSH 
GYQCFPNVSH 
GYQCrPEISH 
GFQCgTAISH 



kWGlYAPYFS 
kWGlYAPYFS 
LWGQYAPFFS 
LWGQYAPFFS 
LWGQYAPFFS 
IiWGQYSPFFS 
LWGQYSPFFS 
LWGQYSPFFS 
LWGQYSPFFS 
LWGQYSPFFS 
VWGQYSPYFS 
sWGQYSPFFS 
FWGQYSPYFS 



LQDESPFP1D 
LQDESPFP1D 
LANES AI S PD 
LANESVISPD 
LANESVISPE 
LEDE1SVSSK 
LEDE1SVSSK 
LEDE1SVSSK 
LEDE1SVSSK 
LEDE1SVSSD 
IEQESAISeD 
LADQSEISPD 
VpSElDaS . . 



50 

VPEDChlTFV 
VPDDChlTFV 
VPAGCrVTFA 
VPAGCrVTFA 
VPAGCrVTFA 
LPKDCr I TLV 
LPKDCrlTLV 
LPKDCr I TLV 
LPKDCrlTLV 
LPKDCrVTFV 
VPHGCeVTFV 
VPQNCklTFV 
I PDDCeVTFA 



NSHS CDTVDG GYQCFPEISH LWGQYSPYFS LEDESAISPD VPDDC-VTFV 
NSHSCDTVDG GYQCFPEISH LWGQYSPYFS LEDESAISPD VPDDCRVTFV 



A. terreus 9A- 1 

A. terreus cbs 

A. ni ger var. awamori 

A. niger T213 

A . niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A . fumigatus 58128 

A . fumigatus 26906 

A . fumigatus 32239 

E. nidulans 

T. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 



51 

QVLARHGARs 
QVLARHGARs 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QVLSRHGARY 
QLLSRHGARY 
QVLS RHGARa 



PThSKtKAYA 
PTDSKtKAYA 
PTE S Kg Kk YS 
PTES KgKkYS 
PTDS KgKkYS 
PTSSKsKkYK 
PTSSKsKkYK 
PTSSKsKkYK 
PTSSKsKkYK 
PTASKsKkYK 
PTESKsKAYS 
PTSSKtElYS 
PT1 KRaaSYv 



AtlAAIQKSA 
AtlAAIQKNA 
ALIEEIQQNV 
ALIEEIQQNV 
ALIEEIQQNA 
kLVTAIQaNA 
kLVTAI QaNA 
kLVTAIQaNA 
kLVTAIQaNA 
kLVTAI QKNA 
GLIEAIQKNA 
QLISrIQKTA 
DLIDrlHhGA 



TaFpGKYAFL 
TaLpGKYAFL 
Tt FDGKYAFL 
Tt FDGKYAFL 
Tt FDGKYAFL 
TdFKGKFAFL 
TdFKG KFAFL 
TdFKGKFAFL 
TdFKGKFAFL 
Te FKGKFAFL 
TsFwGQYAFL 
TaYKGyYAFL 
IsYgPgYEFL 



100 

QSYNYSLDSE 
KSYNYSMGSE 
KTYNYSLGAD 
KTYNYSLGAD 
KTYNYSLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
KTYNYTLGAD 
ETYNYTLGAD 
ESYNYTLGAD 
KDYrYqLGAN 
RTYDYTLGAD 



QVLSRHGARY PTSSK-KAYS ALIEAIQKNA T-FKGKYAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKAYS ALIEAIQKNA TAFKGKYAFL KTYNYTLGAD 



A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13 073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

r. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 
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ELTPFGrNQL 
NLTPFGrNQL 
DLTPFGEQEL 
DLT PFGEQEL 
DLTPFGEQEL 
DLTPFGEQQL 
DLTPFGEQQL 
DLTPFGEQQL 
DLTAFGEQQL 
DLTPFGEQQM 
DLTiFGENQM 
DLTPFGENQM 
ELTRtGQQQM 



rDlGaQFYeR 
qDlGaQFYRR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQR 
VNSGIKFYQK 
VDSGaKFYRR 
IQlGIKFYnH 
VNSGI KFYRR 



YNALTRhlnP 
YDTLTRhlnP 
YESLTRNIIP 
YESLTRNIIP 
YESLTRNIVP 
YKALARSWP 
YKALARSWP 
YKALARSWP 
YKALARSWP 
YKALAgSWP 
YKNLARKnTP 
YKSLARNaVP 
YRALARKSIP 



FVRATD AS R V 
FVRAADSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRSSGSSRV 
FIRASGSDRV 
F I RASGSDRV 
FIRASGSDRV 
FIRASGSDRV 
FIRSSGSDRV 
FIRASGSDRV 
FVRCSGSDRV 
FVRTAGqDRV 



150 

hESAEKFVEG 
hESAEKFVEG 
IASGEKFIEG 
IASGEKFIEG 
IASGKKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
IASGEKFIEG 
VASAEKFING 
IASGrlFIEG 
VhSAENFTQG 



DLTPFGENQM VNSGI KFYRR YKALARK-VP FVRAS GS DRV IASAEKFIEG 
DLTPFGENQM VNSG IKFYRR YKALARKIVP FIRASGSDRV IASAEKFIEG 
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151 200 
A. terreus 9A-1 FQTARqDDHh ANpHQPSPrV DValPEGSAY NNTLEHS1CT AFES...STV 

A. terreus cbs FQNARqGDPh ANpHQPSPrV DWIPEGTAY NNTLEHSICT AFEA. . .STV 

A. niger var. awamori FQSTKLkDPr AqpgQSSPkl DWISEASSs NNTLDPGTCT VFED . . . SEL 
A. niger T213 FQSTKLkDPr AqpgQSSPkl DWISEASSs NNTLDPGTCT VFED. . .SEL 

A. niger NRRL3135 FQSTKLkDPr AqpgQSSPkl DWISEASSs NNTLDPGTCT VFED. . .SEL 

A. fumigatus 13073 FQqAKLADPG A . TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA. . .SQL 
A. fumigatus 32722 FQqAKLADPG A. TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA. . .SQL 

A. fumigatus 58128 FQqAKLADPG A. TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA. . .SQL 

A. fumigatus 26906 FQqAKLADPG A. TNRAAPAI SVIIPESETF NNTLDHGVCT kFEA. . .SQL 
A. fumigatus 32239 FQqANVADPG A . TNRAAPVI SVIIPESETY NNTLDHSVCT NFEA. . .SEL 
E. nidulans FRKAQLhDHG S..gQATPW NVIIPEiDGF NNTLDHSTCV SFEN. . .DEr 

T. thermophilus FQSAKVlDPh SDkHDAPPTI NVIIeEGPSY NNTLDtGSCP VFED. . .SSg 

M. thermophila FHSA1LADRG STvRPTlPyd mWI PETAGa NNTLHND1CT AFEEgpySTI 

Consensus FQSAKLADPG S-PHQASPVI NVIIPEGSGY NNTLDHGTCT AFED SEL 

Consensus phytase FQSAKLADPG SQPHQASPVT DVIIPEGSGY NNTLDHGTCT AFED ... SEL 



A. terreus 9A-1 GDDAvANFTA VFAPAIaQRL EADL PGVqLS TDDWnLMAM CPFETVS1TD 

A. terreus cbs GDAAADNFTA VFAPAIakRL EADL PGVqLS ADDWnLMAM CPFETVS1TD 
A. niger var. awamori ADTVEANFTA TFAPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. niger T213 ADTVEANFTA TFAPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. niger NRRL3135 ADTVEANFTA TFVPSIRQRL ENDLSGVTLT DTEVTyLMDM CSFDTIStST 

A. fumigatus 13073 GDEVAANFTA lFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 32722 GDEVAANFTA lFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 58128 GDEVAANFTA lFAPDIRARa EkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 26906 GDEVAANFTA lFAPDIRARa KkHLPGVTLT DEDWsLMDM CSFDTVARTS 

A. fumigatus 32239 GDEVEANFTA 1FAPAIRARI EkHLPGVqLT DDDWsLMDM CS FDTVARTA 

E. nidulans ADEiEANFTA IMGPPIRkRL ENDLPGI KLT NENVI yLMDM CS FDTMARTA 

T. thermophilus GHDAQE KFAk qFAPAI 1EKI KDHLPGVDLA vSDVpyLMDL CPFETLARNh 

AT. thermophila GDDAQDTY1S TFAGPItARV NANLPGANLT DADTVaLMDL CPFETVAsSS 

Consensus GDDAEANFTA TFAPAIRARL EADLPGVTLT DEDW-LMDM CPFETVARTS 

Consensus phytase GDDVEANFTA LFAPAIRARL EADLPGVTLT DEDWYLMDM CPFETVARTS 

251 300 

A. terreus 9A-1 DAhTLSPFC DLFTAtEWtq YNYL1 S LDKY YGYGGGNPLG 

A. terreus cbs DAhTLSPFC DLFTAaEWtq YNYL1 SLDKY YGYGGGNPLG 

A. niger var. awamori vDTKLSPFC DLFTHdEWih YDYLQSLkKY YGHGAGN PLG 

A. niger T213 vDTKLSPFC DLFTHdEWih YDYLRSLkKY YGHGAGNPLG 

A. niger NRRL3135 vDTKLSPFC DLFTHdEW in YDYLQSLkKY YGHGAGNPLG 

A. fumigatus 13073 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32722 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 58128 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 26906 DASQLSPFC QLFTHnEWkk YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32239 DASELSPFC AI FTHnEWkk YDYLQSLGKY YGYGAGNPLG 

E. nidulans HGTELSPFC AIFTEkEWlq YDYLQSLSKY YGYGAGSPLG 

T. thermophilus TDT.LSPFC ALsTQeEWqa YDYYQS LGKY YGnGGGNPLG 

Af. thermophila sdpatadagg gNGrpLSPFC rLFSEsEWra YDYLQSVGKW YGYGPGNPLG 

Consensus -DATELSPFC ALFTE-EW- - YDYLQSLGKY YGYGAGNPLG 

Consensus phytase DATELSPFC ALFTHDEWRQ YDYLQSLGKY YGYGAGNPLG 
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A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL313 5 

A. fumigatus 13073 

A. fiwiigatus 32722 

A. fumic?atus 58128 

A. fumigatus 26906 

A. jfumigatus 32239 

E. nidulans 

T. thermophilics 

M. thermophila 

Consensus 
Consensus phytase 



301 

PVQGVGWaNE 
PVQGVGWaNE 
PTQGVGYaNE 
PTQGVGYaNE 
PTQGVGYaNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGIGFtNE 
PAQGVGFvNE 
PTQGVGFvNE 



LMARLTRAPV 
L I ARLTRS P V 
LIARLTHSPV 
LIARLTHSPV 
LIARLTHSPV 
LI ARLTRS PV 
LI ARLTRS PV 
L I ARLTRS PV 
LI ARLTRS PV 
LIARLTNSPV 
LIARLTQSPV 
LI ARMTHS PV 
LLARLAgvPV 



HDHTCVNNTL 
HDHTCVNNTL 
HDDTSSNHTL 
HDDTSSNHTL 
HDDTSSNHTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDHTSTNsTL 
QDNTSTNHTL 
QDYTTVNHTL 
RDgTS TNRTL 



DAS PATFPLN 
DANPATFPLN 
DSNPATFPLN 
DSNPATFPLN 
DSS PATFPLN 
vSNPATFPLN 
vSNPATFPLN 
vSNPATFPLN 
vSNPATFPLN 
DSDPATFPLN 
DSNPATFPLD 
DSNPATFPLN 
DGDPrTFPLG 



350 

ATLYADFSHD 
ATLYADFSHD 
STLYADFSHD 
STLYADFSHD 
STLYADFSHD 
ATMYVDFSHD 
ATMYVDFSHD 
ATMYVDFSHD 
ATMYVDFSHD 
ATIYVDFSHD 
rKLYADFSHD 
ATLYADFSHD 
rPLYADFSHD 



PAQGVGF - NE LIARLTHSPV QDHTSTNHTL DSNPATFPLN ATLYADFSHD 
PAQGVGFANE LI ARLTRS PV QDHTSTNHTL DSNPATFPLN ATLYADFSHD 



A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

B. nidulans 

T. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 



351 

SNLVSIFWAL 
SNLVSIFWAL 
NGIISILFAL 
NGIISILFAL 
NGIISILFAL 
NSMVS I FFAL 
NSMVS I FFAL 
NSMVS I FFAL 
NSMVS I FFAL 
NGMIPIFFAM 
NSMISIFFAM 
NTMTS I FaAL 
NDMMGVLgAL 



GLYNGTAPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTkPLS 
GLYNGTE PLS 
GLYNGTGPLS 
GLYNGTE PLS 
GLYNGTE PLS 
GLYNGTE PLS 
GLYNGTQPLS 
GLYNGTAkLS 
Ga YDGVP PLD 



gTSVESVSQT 
gTTVEDITrT 
TTTVENITQT 
TTTVEN I TQT 
TTTVENITQT 
rTSVESaKEl 
rTSVESaKEl 
rTSVESaKEl 
rTSVESaKEl 
qTSeESTKES 
mDSVESIQEm 
TTEIKSIEET 
KTArrDpEEl 



DGYAAAWTVP 
DGYAAAWTVP 
DGFSSAWTVP 
DGFSSAWTVP 
DGFSSAWTVP 
DGYSASWWP 
DGYSASWWP 
DGYSASWWP 
DGYSASWWP 
NGYSASWAVP 
DGYAASWTVP 
DGYSAAWTVP 
GGYAASWAVP 



400 

FAARAYVEMM 
FAARAYIEMM 
FASR1 YVEMM 
FASR1YVEMM 
FAS Rl YVEMM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFEtM 
FGARAYFELM 
FGGRAY I EMM 
FAARiYVEKM 



NSMISIFFAL GLYNGTAPLS TTSVES I EET DGYAASWTVP FGARA YVEMM 
NSMISIFFAL GLYNGTAPLS TTSVES I EET DGYSASWTVP FGARA YVEMM 



401 450 

A. terreus 9A-1 QC RAEKE PLVRVLVNDR VMPLHGCPTD KLGRCKrDAF 

A. terreus cbs QC RAEKQ PLVRVLVNDR VMPLHGCAVD NLGRCKrDDF 

A. niger var. awamori QC . .QAEQE PLVRVLVNDR WPLHGCPID aLGRCTrDSF 

A. niger T213 QC QAEQE PLVRVLVNDR WPLHGCPID aLGRCTrDSF 

A. niger NRRL3135 QC QAEQE PLVRVLVNDR WPLHGCPVD aLGRCTrDSF 

A. fumigatus 13073 QC KSEKE PLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 32722 QC KSEKE PLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 58128 QC KSEKE SLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 26906 QC KSEKE PLVRALINDR WPLHGCDVD KLGRCKLNDF 

A. fumigatus 3223 9 QC KSEKE PLVRALINDR WPLHGCAVD KLGRCKLKDF 

E. nidulans QC E.KKE PLVRVLVNDR WPLHGCAVD KFGRCTLDDW 

T. thermophilus QC DDSDE PWRVLVNDR WPLHGCEVD SLGRCKrDDF 

M. thermophila RCsggggggg ggegrQEKDE eMVRVLVNDR VMTLkGCGAD ErGMCTLErF 

Consensus QC QAEKE PLVRVLVNDR WPLHGCAVD KLGRCKLDDF 

Consensus phytase QC QAEKE PLVRVLVNDR WPLHGCAVD KLGRCKRDDF 
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A. terreus 9A-1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58126 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T. thermophilus 

M. thermophila 

Consensus 
Consensus phytase 
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451 471 

VAGLS FAQAG GNWADCF ~ 

VEGLSFARAG NWAECF 

VrGLSFARSG GDWAECsA — ~ 
VrGLSFARSG GDWAECFA — ~ 
VrGLSFARSG DWAECFA — 
VKGLS WARS G GNWGECFS — - 
VKGLS WARS G GNWGECFS — - 
VKGLS WARS G GNWGECFS — ~ 
VKGLSWARSG GNWGECFS — - 
VKGLS WARS G NSEQSFS — 
VEGLNFARSG GNWkTCFTl- ~ 
VrGLSFARqG GNWEGCYAas e 
IESMAFARGN GKWDlCFA — - 

VEGLSFARSG GNWAECFA-- - 
VEGLSFARSG GNWAECFA. . , 
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CP-l 

ECO RI MGVFVVLLS IATLFGST 
TATATGAATTCATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

1 + + + + + + 60 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHS CDTVDGG 
CATCCGGTACXTGCCOTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 

YQCFPEISHLWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

EDESAISPDVPDDCRVTFVQ 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 

181 + + + + + + 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4 

CP-5 

VLSRHGARYPTSSKS KAYSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGAC 

L I EAI QKNATAFKGKYAFLK 

CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

361 + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGIKFYRRYKALARKIVPF 

TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 + + + + + + 430 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8 

CP- 9 

IRASGSDRVIASAEKFIEGF 
TCATTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 + + + + + + 540 

AGTAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGSQPHQAS PVID 

TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + + + + 6 qq 

AGGTTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 
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CP-10 

CP-11 

VI IPEGSGYNNTLDHGTCTA 

ACGTTATTATTCCAGAAGGaTCcGGTTACAACAACACTTTGGACCACGGTACTTGTACTG 
601 + + + + + + 660 

TGCAATAATAAGGTCTTCCtAGgGGAATGTTGTTGTGAAACCTGGTGCCATGAACATGAC 

FEDSEIiGDDVEANFTALFAP 

CTTTCGAAGACTCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTGAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP-12 

AIRARLEADLPGVTLTDEDV 

CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACG 
721 + + + + + + 780 

GTCGATAATCTCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGC 

CP-13 

VYLMDMCPFETVARTSDATE 
TTGTTTACTTGATGGACATGTGTCCATTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

781 + + + + + + 840 

AACAAATGAACTACCTGTACACAGGTAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRQYDYLQ 

AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGACAATACGACTACTTGC 
841 + + + + + + 900 

TTAACAGAGGTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTGTTATGCTGATGAACG 

CP-14 

CP-15 

SLGKYYGYGAGNPLGPAQGV 

AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 

901 + + + 4- + + 960 

TTAGAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

GFANEL IARLTRS PVQDHTS 
TTGGTTTCGCTAACGAATTGATTGCTAGATTGACTAGATCTCCAGTTCAAGACCACACTT 

961 ~ + + + + + + 1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 

CP-16 

CP-17 

TNHTLDSNPATF PLNATLY .A 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 

1021 - + + + + + + 1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNSMISIFFALGLYNG 

CTGACTTCTCTCACGACAACTCTATGATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 
1081 + + + + + + 114Q 

GACTGAAGAGAGTGCTGTTGAGATACTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18 

CP-19 

TAPLSTTSVESIEETDGYSA 
GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTG 
1141 + + + + + + 1200 

CATGACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAC 
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SWTVPFGARAYVEMMQCQAE 
CTTCTTGGACTGTTCCATTCGGTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTG 

1201 + + + + + + 1260 

GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 
1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 



VDKLGRCKRDDFVEGLS FAR 
CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* Bco RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 
1381 + + + + 1426 

CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 
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1 50 

P. involutus (phyAl) SvP.KnTAPt FPIPeseQrn WSPYSPYFPL AeYkAPPAGC QlnQVNIIQR 

P. involutus (phyA2) SvP.RniAPK FSIPeseQrn WSPYSPYFPL AeYkAPPAGC EInQVNIIQR 

T. pubescens hiPlRdTSAc LdVTrDvQqs WSmYSPYFPa AtYvAPPASC QlnQVHIIQR 

A. pedlades GgwQaTfvQ pfFPpQiQds WAAYTPYYPV qaYtPPPkDC KItQVNIIQR 

P. lycii StQf sfvAAQ LPIPaQntsn WGPYdPFFPV EpYaAPPEGC tVtQVNLIQR 

Basidio S-P-R-TAAQ LPIP-Q-Q-- WSPYSPYPPV A-Y-APPAGC QI-QVNIIQR 



51 100 

P. involutus (phyAl) HGARFPTSGA TTRIKAGLTK LQGvqnfTDA KFNFIkSfkY dLGnsDLVPF 

P. involutus (phyA2) HGARFPTSGA ATRIKAGLSK LQSvqnfTDP KFDFIkSfTY dLGtsDLVPF 

T. pubescens HGARFPTSGA AkR I QTAVAK LKAAsnyTDP 1 LAFVt NyTY sLGqDsLVeL . 

A. pediades HGARFPTSGA GTRIQAAVkK LQSAktyTDP RLDFLtNyTY tLGhDDLVPF 

P. lycii HGARWPTSGA rSRqvAAVAK IQmArpf TDP KYEFLnDfvY kFGvADLLPF 

Basidio HGARFPTSGA ATRIQAAVAK LQSA TDP KLDFL-N-TY -LG-DDLVFF 



101 150 

P. involutus (phyAl) GAaQSfDAGQ EAFARYSkLV SkNNLPFIRA dGSDRWDSA TNWTAGFAsA 

P. involutus (phyA2) GAaQSfDAGl EvFARYSkLV SsDNLPFIRS dGSDRWDTA TNWTAGFAsA 
T. pubescens GAtQSSEAGQ EAFTRYSsLV S aDELP FVRA SGSDRWATA nNWTAGFAlA 

A. pediades GA1QSSQAGE ETFqRYS f LV SkENLPFVRA SSSNRWDSA TNWTEGFSaA 

P. lycii GAnQShQTGt DmYTRYStLf egGDVPFVRA AGdQRWDSS TNWTAGFGdA 

Basidio GA-QSSQAGQ EAFTRYS-LV S-DNLPFVRA SGSDRWDSA TNWTAGFA-A 



151 200 

P. involutus (phyAl) ShNTvqPkLn LILPQtGNDT LEDNMCPaAG DSDPQvNaWL AVafPSITAR 

P. involutus (phyA2) SrNAiqPkLd LILPQtGNDT LEDNMCPaAG ESDPQvDaWL As a f PS VTAQ 

T. pubescens SsNSitPvLs VIISEaGNDT LDDNMC PaAG DSDPQvNqWL AqFAPPMTAR 

A. pediades ShHvlnPiLf VILSEslNDT LDDaMCPnAG sSDPQtGiWt SIYGTPIAnR 

P. lycii SgETvlPtLq WLqEeGNcT LcNNMCPnEv DGDest.tWL GVFAPn ITAR 

Basidio S-NT--P-L- VILSE-GNDT LDDNMC P -AG DSDPQ-N-WL AVFAPPITAR 



201 250 

P. involutus (phyAl) LNAAAPSvNL TDtDAfNLvs LCAF 1 TVS kE kkSdFCtLFE giPGsFeAFa 

P. involutus (phyA2) LNAAAPGANL TDaDAf NLvs LCPFmTVSkE qkSdFCtLFE giPGsFeAFa 

T. pubescens LNAGAPGANL TDtDTyNLlt LCPFETVAtE rrSeFCDIYE elQAE.dAFa 

A. pediades LNqqAPGANI TAaDvsNLip LCAFETIvkE tpSpFCNLF. . tPEEFaqFe 

P. lycii LNAAAPSANL SDsDAltLmd MCPFDTLSsG naSpFCDLF. . tAEEYvSYe 

Basidio LNAAAPGANL TD-DA-NL-- LCPFETVS-E - -S-FCDLFE - -PEEF-AF- 
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251 300 

P. involutus (phyAl) YgGDLDKFYG TGYGQeLGPV QGVGYVNELI ARLTnsAVRD NTQTNRTLDA 

P. involutus (phyA2) YaGDLDKFYG TGYGQAIiGPV QGVGYINELL ARLTnsAVnD NTQTNRTLDA 

T. pubescens YnADLDKFYG TGYGQPLGPV QGVGYINELI ARLTaQnVsD HTQTNsTLDS 

A. pediades YfGDLDKFYG TGYGQPLGPV QGVGYINELL ARLTemPVRD NTQTNRTLDS 

P. lycli YyyDLDKYYG TGpGNALGPV QGVGYVNELL ARLTgQAVRD ETQTNRTLDS 

Basidio Y-GDLDKFYG TGYGQPLGPV QGVGYINELL ARLT-QAVRD NTQTNRTLDS 



301 350 

P. involutus (phyAl) SPvTFPLNKT FYADFSHDN1 MVAVFSAMGL FrQPAPLsTS vPNPwRTWrT 

P. involutus (phyA2) APdTFPLNKT MYADFSHDNl MVAVFSAMGL FrQSAPLsTS tPDPNRTWLT 

T. pubescens SPeTFPLNRT LYADFSHDNQ MVAIFSAMGL FNQSAPLDPT tPDPaRTFLv 

A. pediades SP1TFPLDRS IYADLSHDNQ MIAIFSAMGL FNQSSPLDPS fPNPKRTWVT 

P. lycii dPaTFPLNRT FYADFSHDNt MVP I FAALGL FNaTA. LDPl kPDeNRlWVd 

Basidio SP-TFPLNRT FYADFSHDNQ MVAIFSAMGL FNQSAPLDPS -PDPNRTWVT 



351 400 

P. involutus (phyAl) SsLVPFSGRM WERLsC.f GT tkV RVLVQDqVQP 

P. involutus (phyA2) SsWPFSARM aVERLsC . . a GT tkV RVLVQDqVQP 

T. pubescens kKIVPFSARM WERLdC. .g GA qsV RLLVNDAVQP 

A. pediades SRLtPFSARM VtERLlCqrd GTgsggpsri mrngnvqtfV RILVNDALQP 

P. lycii SKLVPFSGHM tVEKLaC sgkeaV RVLVNDAVQP 

Basidio SKLVPFSARM WERL-C GT V RVLVNDAVQP 



401 441 

P. involutus (phyAl) LEFCGGDrNG 1 CTLAkFVES QtFARsDGaG DFEKCFATSa ~ 

P. involutus (phyA2) LEFCGGDqDG lCALDkFVES QaYARsGGaG DFEKCLATTv ~ 

T. pubescens LAFCGADtsG vCTLDAFVES QaYARNDGEG DFEKCFAT — - 

A. pediades LKFCGGDmDS 1 CTLEAFVE S QkYAREDGQG DFEKCFD ~ 

P. lycii LEFCGG.vDG vCeLsAFVES QtYARENGQG DFAKCgfvPs e 

Basidio LEFCGGD - DG - CTLD AFVE S Q-YAREDGQG DFEKCFATP- - 



Fig. 3b 
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1 50 

A. terreus 9al KhsdCNSVDh GYQCf PELSH kWGlYAPYFS LqDESPFPlD VPeDCHITFV 

A. terreus cbs NhsdCtSVDr GYQCf PELSH kWGlYAPYFS LqDESPFPlD VPdDCHITFV 

A. niger var. awamori NqsTCDTVDq GYQCf SEtSH LWGQYAPFFS LANESAISPD VPaGCRVTFa 

A. niger NRRL3135 NqsSCDTVDq GYQCf SEtSH LWGQYAPFFS LANESvISPE VPaGCRVTFa 

A. fumigatus 13073 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPkDCRITLV 

A. fumlgatus 32722 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPkDCRITLV 

A. fumigatus 58128 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPkDCRITLV 

A. fumigatus 26906 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPkDCRITLV 

A. fumigatus 32239 GSkACDTVEl GYQCsPGtSH LWGQYSPFFS LEDElSVSSD LPkDCRVTFV 

E. nidulans QNHSCNTaDG GYQCf PNVSH VWGQYSPYFS IEQESAISeD VPhGCeVTFV 

T. thermophilus DSHS CNTVEG GYQCrPEISH sWGQYSPFFS LADQSEISPD VPqNCKITFV 

T. lanuginosus nvDIAR hWGQYSPFFS LAEvSEISPA VPkGCRVeFV 

AT. thermophila ESRPCDTpDl GFQCgTAISH FWGQYSPYFS VPsElDaS . . IPdDCeVTFa 

Basidio xSxPxrxtAA qLPipxQxqx xWSPYSPYFP VAxyxA. . . . pPaGCQIxqV 

Consensus NSHSCDTVDG GYQC-PEISH LWGQYSPFFS LADESAISPD VP-GCRVTFV 

FcplO NSHSCDTVDG GYQCFPEISH LWGQYSPFFS LADESAISPD VPKGCRVTFV 



*± 100 

A. terreus 9al QVLARHGARs PThSKTKaYA AtlaAIQKSA Ta FpGKYAFL QSYNYSLDSE 

A. terreus cbs QVLARHGARs PTdSKTKaYA AtlaAIQKNA TaLpGKYAFL KSYNYSMGSE 

A, niger var. awamori QVLSRHGARY PTeSKGKKYS ALIeEIQQNv Tt FDGKYAFL KTYNYSLGAD 

A. niger NRRL3135 QVLSRHGARY PTdS KGKKYS ALIeEIQQNA Tt FDGKYAFL KTYNYSLGAD 

A. fumigatus 13073 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32722 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 58128 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 26906 QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32239 QVLSRHGARY PTASKSKKYk kLVtAIQKNA Te FKGKFAFL ETYNYTLGAD 

E. nidulans QVLSRHGARY PTeSKSKaYS GLIeAIQKNA TsFwGQYAFL ESYNYTLGAD 

T. thermophilus QLLSRHGARY PTSSKTE1YS qLIsrIQKtA TaYKGyYAFL KdYrYqLGAN 

T. lanuginosus QVLSRHGARY PTAhKSEvYA ELLqrlQDtA Te FKGDFAFL RdYaYhLGAD 

AT. thermophila QVLS RHGARa PTlkRAasYv DLIdrlHhGA isYgPgYEFL RTYDYTLGAD 

Basidio Nil qRHGARF PTSGaAtRiq AaVakLQsax xxtDPKLDFL xnx t YxLGxD 

Consensus QVLSRHGARY PTSSKSKKYS ALI-AIQKNA T - FKGKYAFL KTYNYTLGAD 

FcplO QVLSRHGARY PTSSKSKKYS ALIEAIQKNA TAFKGKYAFL KTYNYTLGAD 



A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13 073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 

Basidio 



101 150 

ELTPFGrNQL rDlGaQFYeR YNAL.TRhln PFVRATDAsR VhESAEKFVE 

NLTPFGrNQL qDlGaQFYRR YDTL.TRhln PFVRAADSsR VhESAEKFVE 

DLTPFGEQEL VNSGIKFYQR YESL.TRnll PFIRSSGSsR VIASGEKFIE 

DLTPFGEQEL VNSGIKFYQR YESL . TRnlV PFIRSSGSsR VIASGKKFIE 

DLTPFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

DLTPFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

DLTPFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

DLTAFGEQQL VNSGIKFYQR YKAL.ARsW PFIRASGSDR VIASGEKFIE 

DLTPFGEQQM VNSGIKFYQK YKAL.AgsW PFIRSSGSDR VIASGEKFIE 

DLTiFGENQM VDSGaKFYRR YKnL . ARknt PFIRASGSDR WASAEKFIN 

DLTPFGENQM IQlGIKFYnH YKSL.ARnaV PFVRCSGSDR VIASGrlFIE 

NLTRFGEEQM MESGrQFYHR YREq . ARelV PFVRAAGSAR VIASAEf Fnr 

ELTRtGQQQM VNSGIKFYRR YRAL.ARksI PFVRTAGqDR WhSAENFtQ 

DLvPFGAxQs sQAGqEaFtR YsxLvSxdnL PFVRASGSDR WDSAtNWtA 



Consensus DLTPFGEQQM VNSGIKFYRR YKAL-AR- IV PFVRASGSDR VIASAEKFIE 
FcplO DLTPFGEQQM VNSGIKFYRR YKAL.ARKIV PFVRASGSDR VTASAEKFIE 
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151 



A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumlgatus 13073 

A. fumlgatus 32722 

A. fumigatus 58128 

A, fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 

Basidio 



200 
. .St 
. .St 
.SE 
.SE 

.so 

.SQ 



GFQTARqDDh hAnphQPSPr VDValPEGsA YNNTLEHSLC TAFEs . 
GFQNARqGDP hAnphQPSPr VDWIPEGtA YNNTLEHS I C TAFEa . 
GFQSTKLkDP rAqpgQSSPk IDWISEAsS sNNTLDpGtC TvFEd. 
GFQSTKLkDP rAqpgQSSPk IDWISEAsS sNNTLDpGtC TvFEd. 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa. 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa. 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa... SQ 
GFQqAKLADP gAt.nRAAPa ISVIIPESeT FNNTLDHGVC TkFEa... SQ 
GFQqANVADP gAt . nRAAPV ISVIIPESeT YNNTLDH SVC TnFEa . . . SE 
GFRkAQLhDh g.s.gQATPV VNVIIPEidG FNNTLDHStC vSFEn...dE 
GFQSAKV1DP hSdkhDAPPt INVIIeEGpS YNNTLDtGsC PvFEd. ..Ss 

GFQdAKdrDP rSnkdQAePV INVIISEEtG sNNTLDgltC PAaEe...Ap 
GFHSAl LADR gStvrPTlPy dmWIPETaG aNNTLHNDLC TAFEegPySt 
GFaxA sxntxxPx LxVILSExg. . NDTLDDNMC PxAG 



Consensus GFQSAKLADP -A QASPV INVIIPEG-G YNNTLDHGLC TAFE- -P-SE 

FCplO GFQSAKLADP GANPHQASPV INVIIPEGAG YNNTLDHGLC TAFEE...SE 



A. terreus 9a 1 

A . terreus cbs 

A. niger var. awamori 

A. niger NRRL313 5 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 3223 9 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M . th ermophi 1 a 

Basidio 

Consensus 
FcplO 



201 250 
VGDDavANFT AVFAPAIaqR LEAdLPGVQL StDDWNLMA MCPFETVSlT 
VGDAaADNFT AVFAPAIakR LEAdLPGVQL SADDWNLMA MCPFETVSlT 
LADtVEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LADtVEANFT AtFvPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCS FDTVAr T 
LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCS FDTVAr T 
LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCS FDTVAr T 
LGDEVAANFT ALFAPdIRAR aKkhLPGVtL TDEDWSLMD MCS FDTVArT 
LGDEVEANFT ALFAPAIRAR IEkhLPGVQL TDDDWS LMD MCS FDTVArT 
r ADE I EANFT AIMGPPIRkR LEndLPGIKL TNENVIyLMD MCS FDTMArT 
gGHDaQEKFA kqFAPAI 1EK I KDhLPGVDL AvsDVpyLMD LCPFETLArn 

. DptqpAEFl qVFGPRVl kK ItkhMPGVNL TlEDVplFMD LCPFDTVGsd 
IGDDaQDtYl StFAGPItAR VNAnLPGaNL TDADtVaLMD LCPFETVAsS 
dSDpqxnxWl AVFAPPItAR LNAa a PGaNL TDxDaxNLxx LCPFETVS . . 

LGDDVEANFT AVFAPPIRAR LEA- LPGVNL TDEDWNLMD MCPFDTVA-T 
LGDDVEANFT AVFAPPIRAR LEAHLPGVNL TDEDWNLMD MCP FDTVART 



251 300 

A. terreus 9al dD. .Aht LSPF CDLFTa . - tE WtQYNYLlSL dKYYGYGGGN 

A. terreus cbs dD. .Aht LSPF CDLFTa. .aE WtQYNYLlSL dKYYGYGGGN 

A. nigrer var. awamori Tv. .DTK. . . LSPF CDLFTH. .dE WiHYDYLQSL kKYYGHGAGN 

A. niger NRRL313 5 Tv. .DTK LSPF CDLFTH. .dE WiNYDYLQSL kKYYGHGAGN 

A. fumigatus 13073 SD. -ASQ LSPF CQLFTH. .nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 32722 SD. .ASQ LSPF CQLFTH. .nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 58128 SD. .ASQ LSPF CQLFTH. .nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 26906 SD. .ASQ LSPF CQLFTH. .nE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus 32239 AD. .ASE LSPF CAI FTH . . nE WkKYDYLQSL gKYYGYGAGN 

E. nidulans AH. .GTE LSPF CAIFTE. .kE W1QYDYLQSL sKYYGYGAGS 

T. thermophilus ht..DT LSPF CALsTQ. .eE WqaYDYYQSL g KY YGnGGGN 

T. lanuginosus PvlfPrQ LSPF CHLFTa. .dD Wma YDYYyTL dKYYSHGGGS 

M. thermophila SsdpATadag ggngrpLSPF CrLFSE..sE WraYDYLQSV gKWYGYGPGN 

Basidio xexxSxF CDLFexxpeE FxaFxYxgdL dKFYGtGyGQ 

Consensus SD--ATQ LSPF CDLFTH E W-QYDYLQSL - KYYGYGAGN 

FcplO SD..ATQ LSPF CDLFTH. .DE WIQYDYLQSL GKYYGYGAGN 



Fig. 4b 
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A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL313 5 

A. fumlgatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T . thermophi lus 

T. lanuginosus 

M. thermophi la 

Basidio 



301 350 
PLGPvQGVGW aNELMARLTR A.PVHDHTCv NNTLDAS PAT FPLNATLYAD 
PLGPvOGVGW aNELIARLTR S . PVHDHTCv NNTLDANPAT FPLNATLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSNPAT FPLNSTLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSS PAT FPLNSTLYAD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELlARLTN S . PVQDHTST Ns TLDSDP AT FPLNATI YvD 
PLGPAQGIGF tNELIARLTQ S.PVQDNTST NHTLDSNPAT FPLDrkLYAD 
PLGPAQGVGF vNEL I ARMTH S.PVQDYTTv NHTLDSNPAT FPLNATLYAD 

AFGPSRGVGF vNELIARMTg N 1 P VKDHTTv NHTLDdNPET FPLDAvLYAD 
PLGPTQGVGF vNELLARLA. GvPVRDgTST NRTLDGDPrT FPLGrPLYAD 
PLGPvQGVGY iNELLARLTx qa . VRDNTqT NRTLDSSPxT FPLNrTFYAD 



Consensus PLGPAQGVGF -NELIARLTH S- PVQDHTST NHTLDSNPAT FPLNATLYAD 
FcplO PLGPAQGVGF VNELIARLTH S . PVQDHTST NHTLDSNPAT FPLNATLYAD 



351 4 00 

A. terreus 9al FSHDSnLVSI FWALGLYNGT aPLSqTSVE . . SvsQTDGYA AAWTVP FAAR 

A. terreus cbs FSHDSnLVSI FWALGLYNGT kPLSqTTVE. . ditrTDGYA AAWTVP FAAR 
A. niger var. awamori FSHDNGIISI LFALGLYNGT kPLSTTTVE. . Ni tQTDGFS SAWTVPFASR 

A. niger NRRL3135 FSHDNGIISI LFALGLYNGT kPLSTTTVE. .Ni tQTDGFS SAWTVPFASR 

A. fumigatus 13073 FSHDNSMVSI FFALGLYNGT ePLSrTSVE. . SaKElDGYS ASWvVPFGAR 

A. fumigatus 32722 FSHDNSMVSI FFALGLYNGT gPLSrTSVE. . SaKElDGYS ASWvVPFGAR 

A. fumigatus 58128 FSHDNSMVSI FFALGLYNGT ePLSrTSVE. .SaKElDGYS ASWvVPFGAR 

A. fumigatus 26906 FSHDNSMVSI FFALGLYNGT ePLSrTSVE. .SaKElDGYS ASWvVPFGAR 

A. fumigatus 32239 FSHDNGMIPI FFAMGLYNGT ePLSqTSeE. . StKESNGYS ASWAVPFGAR 

E. nidulans FSHDNSMISI FFAMGLYNGT qPLSmdSVE. . SiQEtnDGYA ASWTVPFGAR 

T. thermophilus FSHDNTMtSI FaALGLYNGT akLSTTelK. . SiEETDGYS AAWTVP FGGR 

T. lanuginosus FSHDNTMtGI FsAMGLYNGT kPLSTSklQP pTgAAADGYA ASWTVPFAAR 

M. thermophila FSHDNdMMGV LgALGaYDGv pPLdkTA. .R rdpEElGGYA ASWAVPFAAR 

Basidio FSHDNqMVAI FsAMGLFNqS aPLdPSxpDP nrt Wv TSklVPFSAR 

Consensus FS HDNTMVS I FFALGLYNGT -PLSTTSVEP -S-EETDGYA ASWTVPFAAR 

FcplO FSHDNTMVSI FFALGLYNGT KPLSTTSVE . . SIEETDGYA ASWTVPFAAR 

401 450 

A. terreus 9al AYVEMMQC. . ra EKEPL VRVLVNDRVM PLHGCPtDKL 

A. terreus cbs AYIEMMQC. .. ra EKQPL VRVLVNDRVM PLHGCAVDNL 

A. niger var. awamori 1YVEMMQC. . Qa EQEPL VRVLVNDRW PLHGCPIDaL 

A. niger NRRL3135 1YVEMMQC . . Qa EQEPL VRVLVNDRW PLHGCPVDaL 

A. fumigatus 13073 AYf EtMQC . . Ks EKEPL VRaLINDRW PLHGCDVDKL 

A. fumigatus 32722 AYf EtMQC. . Ks EKEPL VRaLINDRW PLHGCDVDKL 

A. fumigatus 58128 AYf EtMQC. . Ks EKESL VRaLINDRW PLHGCDVDKL 

A. fumigatus 26906 AYf EtMQC. . Ks EKEPL VRaLINDRW PLHGCDVDKL 

A. fumigatus 32239 AYf EtMQC. . Ks EKEPL VRaLINDRW PLHGCAVDKL 

E. nidulans AYf ELMQC . . E KKEPL VRVLVNDRW PLHGCAVDKF 

T. thermophilus AYIEMMQC. . Dd sDEPV VRVLVNDRW PLHGCEVDsL 

T. lanuginosus AYVELLRC . . Etetsseeee EG. . .EDEPF VRVLVNDRW PLHGCrVDRW 

Af. thermophila iYVEkMRC . . sggggggggg EGrqeKDEeM VRVLVNDRVM TLkGCGaDEr 

Basidio mvVErLxCxx xgtxxxxxxx xxxxxxxxxx VRVLVNDaVq PLEf CGgDxd 

Consensus AYVEMMQC-- E EG EKEPL VRVLVNDRW PLHGCGVDKL 

FcplO AYVEMMQC. EA EKEPL VRVLVNDRW PLHGCGVDKL 



Fig. 4c 
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A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13073 

A. fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 

Basidio 



451 

GRCKrDAFVA GLSFAQAG- . 
GRCKrDDFVE GLSFARAG . . 
GRCtrDsFVr GLSFARSG. . 
GRCtrDsFVr GLSFARSG . . 
GRCK1NDFVK GLSWARSG . . 
GRCK1NDFVK GLSWARSG. . 
GRCK1NDFVK GLSWARSG . . 
GRCK1NDFVK GLSWARSG . . 
GRCKl KDFVK GLSWARSG. . 
GRCtlDDWVE GLNFARSG. . 
GRCKrDDFVr GLSFARqG. . 

GRCRrDEWIK GLTFARqG. 
GmCtlErFIE SMAFARGN. . 
GxCtlDAFVE SqxYAReDgq 



482 

GNWADCF — 

GNWAECF — 

GDWAECsA — — 
GDWAECFA — — 
GNWGECFS — — 
GNWGECFS — — 
GNWGECFS — — 
GNWGECFS — — 
GNSEQSFS — — 
GNWKtCFTl- — 
GNWEGCYAas e- 

GHWDrCF — 

GKWD1CFA — — 
GDFEKCFAtp xx 



Consensus 
FcplO 



GRCK-DDFVE GLSFARSG- - GNWEECFA- - 
GRCKRDDFVE GLSFARSG. . GNWEECFA. . 



Fig. 4d 
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CP-1 

Eco RIMGVFVVLLSIATLFGST17 

TATATGAATTCATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

1 + + + + + + 60 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHS CDTVDGG 37 

CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTC^CTCTTGTGACACTGTTGACGGTG 
61 + + + + + + 12Q 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 . 10 

YQCFPEISHLWGQYSPFFSL 57 

GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATTCTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTAAGAAGAGAA 

ADESAISPDVPKGCRVTFVQ 77 
TGGCTGACGAATCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTC 



181 



+ + + + + + 24Q 

ACCGACTGCTTAGACGATAAAGAGGTCTGCAAGGTTTCCCGACATCTCAATGAAAGCAAG 

CP-4.10 

CP-5.10 

VLSRHGARYPTSSKSKKYSA 97 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGAAGTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCTTCATGAGAC 

LIEAI QKNATAFKGKYAFLK 117 

CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 360 

GAAACTAACTTCGATAAGTTTTCITGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 . 10 

TYNYTLGADDLTPFGEQQMV 137 

AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGG 
361 + + + + + + 42Q 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACC 



421 



NSGIKFYRRYKALARKIVPF 157 

TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

+ + + + + + 4Q0 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8.10 

CP-9.10 

VRASGSDRVIASAEKFIEGF 177 

TCGTTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 
481 + + + + + + 540 

AGCAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGANPHQASPVIN 197 

TCCAATCTGCTAAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTA 

541 + + + + + + 6Q0 

AGGTTAGACGATTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAAT 



Fig. 5a 
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CP-10.10 

CP-11.10 



VI IPEGAGYNNTLDHGLCTA 217 

ACGTTATTATTCCAGAAGGTGCTOGTTACAACAACACTTTGGACCACGGTTTGTGTACTG 
6 0l + + + + + + 66Q 

TGCAATAATAAGGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGAC 

FEESELGDDVEANFTAVFAP 237 

CTTTCGAAGAATCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTTAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAG 

CP-12 .10 

PIRARLEAHLPGVNLTDEDV 257 

CACCTATTAGAGCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACG 
721 + + + + + + 780 

GTGGATAATCTCGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGC 

CP-13.10 

VNLMDMCPFDTVARTSDATQ 277 

TTGTTAACTTGATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTC 

781 + + + + + + 840 

AACAATTGAACTACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAG 

LSPFCDLFTHDEWIQYDYLQ 297 

AATTGTCTCCATTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGC 
841 + + + + + + goo 

TTAACAGAGGTAAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACG 

CP-14.10 

CP-15. 10 

SLGKYYGYGAGNPLGPAQGV 317 

AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 

901 + + + + + + 960 

TTAGAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

GFVNEL IARLTHS PVQDHTS 337 
TTGGTTTCGTTAACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTT 

961 + + --- + + + + 1020 

AACCAAAGCAATTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAA 

CP-16.10 

CP-17.10 

TNHTLDSNPATFPLNATLYA 357 
CTACTAACCACACTTTGGACTCTAACCCAGCJTACTTTCCCATTGAACGCTACTTTGTACG 
1021 + - + + + + + 1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNTMVS I FFALGLYNG 377 

CTGACTTCTCTCAO^CAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + H40 

GACTGAAGAGAGTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18.10 

CP-19.10 

TKPLSTTSVESIEETDGYAA397 

GTACTAAGCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACGCTG 
1141 ____ + + + + + + 1200 

CATGATTCGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCGAATGCGAC 



Fig. 5b 
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SWTVPFAARAYVEMMQCEAE 417 
CTTCTTGGACTGTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTG 

1201 + + + + + + 1260 

GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACACTTCGAC 

CP-20 . 10 

CP-21.10 

KEPLVRVLVNDRVVPLHGCG 437 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 
1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGLS FAR 457 
GTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

CACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 . 10 

SGGNWEECFA* Eco RI 467 
GATCTGGTGGTAACTGGGAAGAATGTTTCGCTTAAGAATTCATATA 
1381 + + + + 1426 

CTAGACCACCATTGACCCTTCTTACAAAGCGAATTCTTAAGTATAT 



Fig. 5c 
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PCT/DKOO/00025 



1 

P. involutus {phyAl) 

P. involutus {phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al KhsdCNSVDh 

A. terreus cbs NhsdCtSVDr 

A. niger var. awamori NqsTCDTVDq 

A. niger T213 NqsSCDTVDq 

A. niger NRRL3135 NqsSCDTVDq 

A. Tumigatus ATCC13073 GSkSCDTVDl 

A. fumigatus ATCC32722 GSkSCDTVDl 

A. fu/nisratus ATCC58128 GSkSCDTVDl 

A. fumigatus ATCC26906 GSkSCDTVDl 

A. fumigatus ATCC32239 GSkACDTVEl 

E. nidulans QNHS CNTaDg 

T. thermophilus DSHSCNTVEg 

T. lanuginosus 

Af. thezmophila ESRPCDTpDl 



50 

-FPipeseqR nWSPYSPYFP LAEykA. . . . pPaGCQInqV 
-FsipeseqR nWSPYSPYFP LAEykA. . . . pPaGCelnqV 
-LDvtRDVqQ sWSmYSPYFP aAtyvA. . . . pPaSCQInqV 
-pffpPQIqD sWAaYTPYYP VqAyTP.... pPKDCKITqV 
-LPipAQnTs nWGPYdPFFP VEpyAA. . . . pPEGCtVTqV 
GYQCfPELSH kWGlYAPYFS LqDESPFPlD VPEDCHITFV 
GYQCfPELSH kWGlYAPYFS LqDESPFPlD VPDDCHITFV 
GYQCf SEtSH LWGQYAPFFS LANES AI S PD VPaGCRVTFa 
GYQCfSEtSH LWGQYAPFFS LANESvISPD VPaGCRVTFa 
GYQCf SEtSH LWGQYAPFFS LANESvISPE VPaGCRVTFa 
GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPKDCRITLV 
GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPKDCRITLV 
GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPKDCRITLV 
GYQCsPAtSH LWGQYSPFFS LEDElSVSSK LPKDCRITLV 
GYQCsPGtSH LWGQYSPFFS LEDE1SVSSD LPKDCRVTFV 
GYQCf PNVSH VWGQYSPYFS IEQESAISeD VPhGCeVTFV 
GYQCrPEISH sWGQYSPFFS LADQSEISPD VPQNCKITFV 

nvDIAR hWGQYSPFFS LAEvSEISPA VPKGCRVeFV 

GFQCgTAISH FWGQYSPYFS VPsElDaS . . I PDDCe VTFa 



Consensus Seq. 11 



NSHSCDTVD- GYQC-PEISH LWGQYSPFFS LADES AISPD VPKGCRVTFV 



P . invol u tus (phyAl ) 

P. involutus (phyA2) 

T. pubescens 

A. pediades 

P . lyci i 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL313 5 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 

Consensus Seq. 11 



51 100 
NIIqRHGARF PTSGaTtRik AgLtKLQgvq nftDAKFnFI KSFKYdLGns 
NIIqRHGARF PTSGaAtRik AgLsKLQsvq nftDPKFDFI KSFtYdLGTs 
HI I qRHGARF PTSGaAKRiq TaVAKLKaaS nytDPlLAFV tnYtYSLGqD 
NIIqRHGARF PTSGaGtRiq AaVKKLQsak TytDPRLDFL t n Y t YTLGhD 
NLIqRHGARW PTSGarsRqv AaVAKIQmar Pf tDPKYEFL NdFvYkFGvA 
QVLARHGARs PThSKTKaYA AtlAalQKSA TaFpGKYAFL QSYNYSLDSE 
QVLARHGARs PTdSKTKaYA AtlAalQKNA TaLpGKYAFL KSYNYSMGSE 
QVLSRHGARY PTeSKGKKYS ALIEelQQNv TtFDGKYAFL KTYNYSLGAD 
QVLSRHGARY PTeSKGKKYS ALIEelQQNv TtFDGKYAFL KTYNYSLGAD 
QVLSRHGARY PTdS KGKKYS ALIEelQQNA TtFDGKYAFL KTYNYSLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL KTYNYTLGAD 
QVLSRHGARY PTASKSKKYk kLVtalQKNA TeFKGKFAFL ETYNYTLGAD 
QVLSRHGARY PTeSKSKaYS GLIEalQKNA TsFwGQYAFL ESYNYTLGAD 
QLLSRHGARY PTSSKTElYS qLI sRIQKtA TaYKGyYAFL KdYrYqLGAN 

QVLSRHGARY PTAhKSEvYA ELLQRIQDtA TeFKGDFAFL RdYaYhLGAD 
QVLSRHGARa PTlkRAasYv DLIDRIHhGA isYgPgYEFL RTYDYTLGAD 

QVLSRHGARY PTSSKSKKYS ALIERIQKNA T-FKGKYAFL KTYNYTLGAD 



Fig. 6a 
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P . invol u t us ( phyAl ) 

P . invol utus ( phyA2 ) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9a 1 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A . niger NRRL3135 

A . fumigatus ATCC13073 

A. fuznicjatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fiunigatus ATCC32239 

E. nidulans 

T . th ermopiii 2 us 

T. lanugijiosus 

Jtf. theinmophiJa 

Consensus Seg. 11 



101 150 
DLvPFGAaQs f DAGqEaFaR YskLvSKNnL PFIRAdGSDR WDSAtNWtA 
DLvPFGAaQs f DAGLEvFaR YskLvSsDnL PFIRSdGSDR WDTAtNWtA 
SLveLGAtQs sEAGqEaFtR YsSLvSaDeL PFVRASGSDR WATANNWtA 
DLvPFGAlQs sQAGeEtFQR Ysf LvSKEnL PFVRASSSNR WDSAtNWtE 
DLlPFGANQs hQTGtDMYtR YsTLfEgGdV PFVRAAGdQR WDSStNWtA 
ELTPFGrNQL rDlGaQFYeR YNAL.TRHIn PFVRATDAsR VhESAEKFVE 
NLTPFGrNQL qDl GaQFYRR YDTL.TRHIn PFVRAADSsR VhESAEKFVE 
DliTPFGEQEL VNSGI KFYQR YESL.TRNII PFIRSSGSsR VIASGEKFIE 
DLTPFGEQEL VNSGI KFYQR YESL.TRNII PFIRSSGSsR VIASGEKFIE 
DLTPFGEQEL VNSGI KFYQR YESL.TRNIV PFIRSSGSsR VIASGKKFIE 
DLTPFGEQQL VNSGI KFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTPFGEQQL VNSGI KFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTPFGEQQL VNSGI KFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTAFGEQQL VNSGI KFYQR YKAL.ARSW PFIRASGSDR VIASGEKFIE 
DLTPFGEQQM VNSGI KFYQK YKAL.AgSW PFIRSSGSDR VIASGEKFIE 
DLT i FGENQM VDSGaKFYRR YKnL.ARKnt PFIRASGSDR WASAEKFIN 
DLTPFGENQM IQlGIKFYnH YKSL . ARNaV PFVRCSGSDR VIASGrlFIE 

NLTRFGE EQM MESGrQFYHR YREq.AREIV PFVRAAGSAR VIASAEf Fnr 
ELTRtGQQQM VNSGIKFYRR YRAL.ARKsI PFVRTAGqDR WhSAENFtQ 

DLTPFGENQM VNSGIKFYRR YKAL-ARNIV PFVRASGSDR VIASAEKFIE 



P . i n vol utus ( phyAl ) 

P . invol utus ( phyA2 ) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC3272 2 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 



151 

GFaSA . . 
GFaSA . . 
GFalA. . 
GFsAA. . 
GFgdA. . 



200 



. - shNtvqPk LNLILPQ . . T gNDTLEDNMC PAaGD 

..srNaiqPk LDLILPQ. .T gNDTLEDNMC PAaGE 

. .SsNsiTPV LSVIISE. .A gNDTLDDNMC PAaGD 

.-ShHvlNPI LfVILSE..S LNDTLDDAMC PnaGs 

..sgEtvlPt LQWLQE..E gNcTLcNNMC PnevD 

GFQTARqDDh hAnpHQPSPr VDValPEGSA YNNTLEHSLC TAFEs . . . ST 
G FQNARqGDP hAnpHQPSPr VDWI PEGTA YNNTLEHS I C TAFEA. . .ST 
GFQSTKLkDP rAqpgQSSPk IDWISEASS sNNTLDpGtC TvFED...Se 
GFQSTKLkDP rAqpgQSSPk IDWISEASS sNNTLDpGtC TvFED. . .Se 
GFQSTKLkDP rAqpgQSSPk IDWISEASS SNNTLDpGtC TvFED . . . Se 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC TkFEA. . . Sq 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC TkFEA... Sq 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC TkFEA. . .Sq 
GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC TkFEA... Sq 
GFQqANVADP gAt . NRAAPV ISVIIPESeT YNNTLDHSVC TnFEA. . .Se 
GFRkAQLhDh g.s.gQATPV VNVIIPEidG FNNTLDHStC vSFEN. . .de 
GFQSAKV1DP hSdkHDAPPt INVIIeEGPS YNNTLDtGsC PvFED . . . SS 

GFQdAKdrDP rSnkDQAePV INVIISEETG sNNTLDgltC PAaEE...AP 
GFHSA1LADR gStvRPTlPy dmWlPETAG aNNTLHNDLC TAFEEgpyST 
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GFQSAKLADP -A- -HQ ASP V INVTIPEGSG YNNTLDHGLC TAPED ST 



Fig. 6b 
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P . invol u tus ( phyAl ) 

P. involutus (phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. ruaiigratus ATCC58128 

A. fuini^atus ATCC26906 

A. fumigatus ATCC32239 

S. nidulans 

T. thermophil us 

T. lanuginosus 

M. thermophila 



201 250 
.SDpqvnaWl AVafPSItAR LNAaaPSVNL TDtDaf NLVs LCAF1TVSK. 
.SDpqvDaWl AsafPSVtAQ LNAaaPGaNL TDADaf NLVs LCPFmTVSK. 
.SDpqvnQWl AqFAPPMtAR LNAg a PGaNL TDtDtyNLLt LCPFETVAt . 
.SDpqtGiWT SIYGTPIanR LNqqaPGaNI TAADVsNLIp LCAFETIvK. 
.GDESt.tWl GVFAPnltAR LNAaaPSaNL SDsDaLtLMD MCPFDTLSs. 
VGDDAvANFT AVFAPAIaqR LEAdLPGVQL StDDWNXiMA MCPFETVS1T 
VGDAAADNFT AVFAPAIakR LEAdLPGVQL SADDWNLMA MCPFETVS1T 
LADtvEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LAD t vEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LADtvEANFT AtFvPSIRqR LEndLSGVtL TDtEVtyLMD MCSFDTIStS 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCS FDTVART 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCS FDTVART 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD MCS FDTVART 
LGDEvAANFT ALFAPdIRAR aKkhLPGVtL TDEDWSLMD MCS FDTVART 
LGDEvEANFT ALFAPAI RAR IEkhLPGVQL TDDDWSLMD MCS FDTVART 
rADEiEANFT AIMGPPIRkR LEndLPGIKL TNENVIyLMD MCSFDTMART 
gGHDAQEKFA kqFAPAI 1 EK I KDhLPGVDL AvsDVpyLMD LCPFETLARn 

. DptqpAEFl qVFGPRVlkK ItkhMPGVNL TlEDVpl FMD LCPFDTVGsd 
IGDDAQDtYl StFAGPItAR VNAnLPGaNL TDADtVaLMD LCPFETVAsS 
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LGDDAEANFT AVFAPPIRAR LEA-LPGVNL TDEDWNLMD MCP FDTVART 



251 300 

P. involutus (phyAl) ekkSdF CtLFegiPGs FeaFAYggdL dKFYGtGyGQ 

P. involutus iphyA2) eqkSdF CtLFegiPGs FeaFAYagdL dKFYGtGyGQ 

T. pubescens errSeF CDIYeelqAE .daFAYnadL dKFYGtGyGQ 

A. pediades etpSPF CNLF. .TPEE FaQFEYFgdL dKFYGtGyGQ 

P. lycii gnaSPF CDLF. .TAEE YvsYEYYydL dKYYGtGPGN 

A. terreus 9al dD. .Aht LSPF CDLF. .TAtE WtQYNYLlSL dKYYGYGGGN 

A. terreus cbs dD. .Aht LSPF CDLF. .TAAE WtQYNYLlSL dKYYGYGGGN 

A. niger var. awamori Tv. .DTK LSPF CDLF. .ThDE WiHYDYLQSL kKYYGHGAGN 

A. niger T213 Tv. .DTK LSPF CDLF. . ThDE WiHYDYLRSL kKYYGHGAGN 

A. niger NRRL313 5 Tv. .DTK LSPF CDLF. .ThDE WiNYDYLQSL kKYYGHGAGN 

A. fumigatus ATCC13073 SD. .ASQ LSPF CQLF. .ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC32722 SD. .ASQ LSPF CQLF. .ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumig-atus ATCC58128 SD. .ASQ LSPF CQLF. .ThNE WkKYNYLQSL gKYYGYGAGN 

A. funua-atus ATCC26906 SD. .ASQ LSPF CQLF.. ThNE WkKYNYLQSL gKYYGYGAGN 

A. fumigatus ATCC3223 9 AD. . ASE LSPF CAIF . . ThNE WkKYDYLQSL gKYYGYGAGN 

B. nidulans AH. .GTE LSPF CAIF..TEKE W1QYDYLQSL sKYYGYGAGS 

T. thermophilus ht . . DT . . . . LSPF CALs . . TqEE WqaYDYYQSL g KYYGnGGGN 

T. lanuginosus PvlfPrQ LSPF CHLF. .TADD WmaYDYYyTL dKYYSHGGGS 

M. thermophila SsdpATadag ggngrpLSPF CrLF..SEsE WraYDYLQSV gKWYGYGPGN 
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SD--ATQ LSPF CDLF- -TADE W-QYDYLQSL -KYYGYGAGN 



Fig. 6c 
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P . invol utus ( phyAl ) 

P . invol utus ( phyA2 ) 

T. pubescens 

A. pediades 

P. lycii 

A . terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. .fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC3223 9 

E. nidulans 

T . thermophi lus 

T. lanuginosus 

M. th ermophi 1 a 



301 350 
eLGPvQGVGY vNELIARLTN S.AVRDNTqT NRTLDAS PvT FPLNkTFYAD 
ALGPvQGVGY i NE LLARLTN S.AVNDNTqT NRTLDAaPDT FPLNkTMYAD 
PLGPvQGVGY iNELIARLTa q.nVsDHTqT NsTLDSSPET FPLNrTLYAD 
PLGPvQGVGY iNELLARLTE m . PVRDNTqT NRTLDSSPlT FPLDrSIYAD 
ALGPvQGVGY vNELLARLTg q.AVRDETqT NRTLDS DP AT FPLNrTFYAD 
PLGPvQGVGW aNELMARLTR A . PVHDHTCv NNTLDAS PAT FPLNATLYAD 
PLGPvQGVGW aNELIARLTR S . PVHDHTCv NNTLD AN PAT FPLNATLYAD 
PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSN PAT FPLNSTLYAD 
PLGPTQGVGY aNELIARLTH S . PVHDDTSS NHTLDSNPAT FPLNSTLYAD 
PLGPTQGVGY aNELIARLTH S. PVHDDTSS NHTLDS S PAT FPLNSTLYAD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTR S.PVQDHTST NsTLvSNPAT FPLNATMYvD 
PLGPAQGIGF tNELIARLTN S.PVQDHTST NsTLDSDPAT FPLNATI YvD 
PLGPAQGIGF tNELIARLTQ S . PVQDNTST NHTLDSNPAT FPLDrkLYAD 
PLGPAQGVGF vNEL I ARMTH S.PVQDYTTv NHTLDSNPAT FPLNATLYAD 

AFGPSRGVGF vNEL I ARMTg NlPVKDHTTv NHTLDdNPET FPLDAvLYTUD 
PLGPTQGVGF vNELLARLA . GvPVRDgTST NRTLDGD P r T FPLGrPLYAD 
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PLGPAQGVGF -NELIARLTH S-PVQDHTST NHTLDSNPAT FPLNATLYAD 



P. involutus (phyAl) 

P. involutus (phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9a 1 

A. teirjreus cbs 

A. .niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. thermophi lus 

T. lanuginosus 

M. thermophila 



351 400 

FSHDN1MVAV FsAMGLFrqP aPLSTSvpNP wrt Wr TSS1VPFSGR 

FSHDN1MVAV FsAMGLFrqS aPLSTSTpDP nrt Wl TSSvVPFSAR 

FSHDNqMVAI FsAMGLFNqS aPLdPTTpDP art Fl vkkiVPFSAR 

LSHDNqMIAI FsAMGLFNqS sPLdPSfpNP krt Wv TSRltPFSAR 

FSHDNTMVP I FaALGL FNAT a.LdPlkpDe nrl Wv DSklVPFSGH 

FSHDSnLVSI FWALGLYNGT aPLSqTSVES Vs. . QTDGYA AAWTVP FAAR 

FSHDSnLVSI FWALGLYNGT KPLSqTTVEd It.. rTDG YA AAWTVPFAAR 

FSHDNGIISI LFALGLYNGT KPLSTTTVEN It . . QTDGFS SAWTVPFASR 

FSHDNGIISI LFALGLYNGT KPLSTTTVEN It.. QTDGFS SAWTVPFASR 

FSHDNGIISI LFALGLYNGT KPLSTTTVEN It.. QTDGFS SAWTVPFASR 

FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak. .E1DGYS ASWvVPFGAR 

FSHDNSMVSI FFALGLYNGT gPLSrTSVES ak. .ElDGYS ASWvVPFGAR 

FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak .. ElDGYS ASWvVPFGAR 

FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak .. ElDGYS ASWvVPFGAR 

FSHDNGMIPI FFAMGLYNGT EPLSqTSeES tk. . ESNGYS ASWAVPFGAR 

FSHDNSMISI FFAMGLYNGT QPLSmdSVES Iq . . EmDGYA ASWTVPFGAR 

FSHDNTMtSI FaALGLYNGT akLSTTelKS I e . . ETDGYS AAWTVPFGGR 

FSHDNTMtGI FsAMGLYNGT KPLSTSklQP ptgaAADGYA ASWTVPFAAR 

FSHDNdMMGV LgALGaYDGv pPLdkTArrd . . peElGGYA ASWAVPFAAR 
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FS HDNTMVS I FFALGLYNGT KPLSTTSVES I ETDGYA ASWTVPFAAR 



Fig. 6d 
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P . in vol u t us ( phyAl ) 

P. involutus (phyA2) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A, niger NRRL3135 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC58128 

A . fumigatus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophila 



401 450 

mvVErLsC . . f Gt Tk VRVLVQDQVq PLEf CGgDRn 

maVErLsC. AGt Tk VRVLVQDQVq PLEf CGgDQd 

mvVErLDC. GGa Qs VRLLVNDaVq PLafCGaDts 

mvtErLlCQr DGtGsGGpsr imrNgnvQTF VRILVNDaLq PLkf CGgDmd 

mtVEkLaC egKea VRVLVNDaVq PLEfCGg.vd 

AYVEMMQCrA EK...EPL VRVLVNDRVM PLHGCPtDKL 

AYI EMMQCr A EK- . . QPL VRVLVNDRVM PLHGCAVDNL 

1YVEMMQCQA EQ...EPL VRVLVNDRW PLHGCPIDaL 

1YVEMMQCQA EQ...EPL VRVLVNDRW PLHGCPIDaL 

1YVEMMQCQA EQ...EPL VRVLVNDRW PLHGCPVDaL 

AYf EtMQCKS EK. . .EPL VRaLINDRW PLHGCDVDKL 

AYf EtMQCKS EK. . .EPL VRaLINDRW PLHGCDVDKL 

AYf EtMQCKS EK. . .ESL VRaLINDRW PLHGCDVDKL 

AYf EtMQCKS EK. . .EPL VRaLINDRW PLHGCDVDKL 

AYf EtMQCKS EK. . .EPL VRaLINDRW PLHGCAVDKL 

AYf ELMQCE KK...EPL VRVLVNDRW PLHGCAVDKF 

AYIEMMQCDD SD...EPV VRVLVNDRW PLHGCEVDsL 

AYVELLRCET ETsSeEEeEG ..ED... EPF VRVLVNDRW PLHGCrVDRW 
iYVEkMRCsG GGgGgGGgEG . . rQekdEeM VRVLVNDRVM TLkGCGaDEr 
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AYVEMMQCEA GG-G-GG-EG - -EK EPL VRVLVNDRW PLHGCGVDKL 



P. involutus (phyAl) 

P . invol utus { phyA2 ) 

T. pubescens 

A. pediades 

P. lycii 

A. terreus 9al 

A. terreus cbs 

A . niger var. awamori 

A . niger T213 

A . niger NRRL313 5 

A. fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigratus ATCC58128 

A. fumigatus ATCC26906 

A. fumigatus ATCC3223 9 

E. nidulans 

T. thermophilus 

T. lanuginosus 

M. thermophi 1 a 



451 

GlCtLAKFVE SqTFARSDga 
GlCaLDKFVE S q AYARS Gg a 
GvCtLDAFVE S q A Y ARNDg e 
SlCtLEAFVE SqkYAReDgq 
GvCELsAFVE SqTYAReNgq 
GRCKrDAFVA GLSFAQAG . . 
GRCKrDDFVE GLSFARAG. . 
GRCtrDsFVr GLSFARSG. . 
GRCtrDsFVr GLSFARSG. . 
GRCtrDsFVr GLSFARSG. . 
GRCKLNDFVK GLSWARSG . . 
GRCKLNDFVK GLSWARSG . . 
GRCKLNDFVK GLSWARSG . . 
GRCKLNDFVK GLSWARSG.. 
GRCKLKD FVK GLSWARSG. . 
GRC t LDDWVE GLNFARSG. . 
GRCKrDDFVr GLSFARqG . . 

GRCRrDEW I K GLTFARqG. 
GmCtLErFIE SMAFARGN. . 



482 

GDFEKCFAts a- 

GDFEKCLAtt v- 

GDFEKCFAt- — 

GDFEKCFD — — 

GDFAKCgfvp se 

GNWADCF — 

GNWAECF — 

GDWAECsA — — 

GDWAECFA — — 

GDWAECFA — — 

GNWGECFS — — 

GNWGECFS — — 

GNWGECFS — — 

GNWGECFS — — 

GNSEQSFS — — 

GNWktCFTl- — 

GNWEGCYAas e- 

GHWDrCF ~< 

GKWD1CFA — — 
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GRCKLDDFVE GLSFARSG-- GNWAECF A- - - 



Fig. 6e 
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MGVFVVLLS IATLFGSTSGT 20 

ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 
1 + + + + + + 6Q 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

ALGPRGNSHSCDTVDGGYQC 40 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
61 + + + + + + 120 

CGGAACCCAGGAGCACCATTT^AGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

FPEISHLWGTYSPYFSLADE 60 
TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCT^ 

121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 

SAISPDVPDDCRVTFVQVLS 80 
TCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 
187 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

RHGARYPTSSASKAYSALIE 100 
AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 

241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 

AIQKNATAFKGKYAFLKTYN 120 
GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

YTLGADDLTPFGENQMVNSG 140 
TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 
361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 

IKFYRRYK ALARKIVPFIRA 160 
ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

SGSDRVIASAEKF IEGFQSA 180 
TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 540 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 

KLADPGSQPHQAS PVINVI I 200 
AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 
541 + + + + + + 600 

TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 

PEGSGYNNTLDHGTCTAFED 220 
CCAGAAGGATCCGGTTACT^ACAACACTTTGGACCACGGTACTTGTACTGCTTTCGAAGAC 
601 + + + + + + ego 

GGTCTTCCTAGGCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGACGAAAGCTTCTG 

Fig. 7a 
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SELGDDVEANFTALFAPAIR 24 0 
TCTGAATTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 
661 + + + + + + 720 

AGACTTAATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 

ARLEADLPGVTLTDEDVVYL 260 
GCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTC 
721 + + + + + + 

CGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCITCTGCAACAAATGAAC 

MDMCPFDTVARTSDATELSP 280 
ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 
781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 

FCALFTHDEWIQYDYLQSLG 300 

TTCTGTGCTTTGTTCACTCACGACGAATGGATCC71ATACGACTACTTGCAAAGCTTGGGT 
841 -f- + + + + . + 900 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 

KYYGYGAGNPLGPAQGVGFA 320 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 
901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCGA 

NELIARLTHSPVQDHTSTNH 340 
AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

TLDSNPATFPLNATLYADFS 360 
ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

HDNTMI S I FFALGLYNGTKP 380 

CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 
1081 + + + + + + 114Q 

GTGCTGTTGTGATACTATAGATAAAAGAAGCGAAACCCT^AACATGTTGCCATGGTTCGGT 

LSTTSVES IEETDGYSASWT 400 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 
1141 + + + + + + 12 oo 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

VPFAARAYVEMMQCQAEKEP 420 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTGAAAAGGAACCA 
1201 + + + + + + 126Q 

CAAGGT7UVGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 

LVRVLVNDRVVPLHGCAVDK 440 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 
1261 + + + , + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 



Fig. 7b 
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LGRCKRDDFVEGLS FARSGG 460 
TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 



NWAECFA* 467 
AACTGGGCTGAATGTTTCGCTTAA 

1381 + + 1410 

TTGACCCGACTTACAAAGCGAATT 



Fig. 7c 
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MGVFVVLLS IATLFGSTSGT 20 

ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 
1 + + + + + + g0 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

ALGPRGNSHSCDTVDGGYQC 40 

GCCITGGGTCCTCGTGGTAACTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
61 + + + + + + 12Q 

CGGAACCCAGGAGCACCATTGAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

FPEISHLWGTYSPFFSLADE 60 

TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 
121 + + + + + + 1Q0 

AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTAAGAAGAGAAACCGACTGCTT 

SAISPDVPKGCRVTFVQVLS 80 

TCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 
181 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 

RHGARYPTSSASKAYSALIE 100 

AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCGTACTCTGCTTTGATTGAA 
241 + + + + + + 30Q 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGCATGAGACGAAACTAACTT 

AIQKNATAFKGKYAFLKTYN 120 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

YTLGADDLTPFGEQQMVNSG 140 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 
361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 

IKFYRRYK ALARKIVPFIRA 160 
ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

SGSDRVIASAEKFIEGFQSA 180 

TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 _____ + + + + + + 540 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 

KLADPGANPHQAS PVINVI I 200 

AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTC<_AGTTATTAACGTTATTATT 
541 _____ + + + + + + 600 

TTC_^CCGACTGGGTCC_^CGATTGGGTG_X5GTTCGAAGAGGTCAATAATTGC_^TAATAA 

PEGAGYNNTLDHGLCTAFEE 220 

CCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTGCTTTCGAAGAA 
601 + + + + + + 66Q 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 



Fig. 8a 
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SELGDDVEANFTAVFAPPIR 240 

TCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 
661 + + + + + + 720 

AGACTTAACCCACTGCTGGAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 

ARLEAHLPGVNLTDEDVVNL 260 

GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 
721 + + + + + + 780 

CGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 

MDMCPFDTVARTSDATQLS P 280 
ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCGA 
781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 

FCDLFTHDEWIQYDYLQSLG 300 
TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 
841 + + + + + + 900 

AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 

KYYGYGAGNPLGPAQGVGFV 320 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 
901 + + + + + + 96Q 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCWV 

NELIARLTHSPVQDHTSTNH 340 
AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

TLDSNPATFPLNATLYADFS 360 
ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 

HDNTMVS IFFALGLYNGTKP 380 

C7VCGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAAGCCA 
1081 + + + + + + H40 

GTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCGAAAGATGTTGCCATGATTCGGT 

LSTTSVES IEETDGYSASWT 400 
TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 
1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

VPFAARAYVEMMQCEAEKEP 420 
GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 
1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 

LVRVLVNDRVVPLHGCGVDK 440 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGGTGTTGACAAG 
1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACCACAACTGTTC 
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LGRCKRDDFVEGLSFARSGG 460 
TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 

1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 



NWEECFA* 467 
AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 
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MGVFVVLLS IATLFGSTSGT 20 
ATGGGGGTTTTCGTCGTTCTATTATCTATCGCGACTCTGTTCGGCAGCACATCGGGCACT 

1 + + + + + + 60 

TACCCCCAA7VAGCAGCAAGATAATAGATAGCGCTGAGACAAGCCGTCGTGTAGCCCGTGA 

ALGPRGNHSKSCDTVDLGYQ 40 

GCGCTGGGCCCCCGTGGAAATCACTCCAAGTCCTGCGATACGGTAGACCTAGGGTACCAG 
6! + + + + + + 12Q 

CGCGACCCGGGGGCACCTTTAGTGAGGTTCAGGACGCTATGCCATCTGGATCCCATGGTC 

CSPATSHLWGTYS PYFSLED 60 

TGCTCCCCTGCGACTTCTCATCTATGGGGCACGTACTCGCCATaCTTTTCGCTCGAGGAC 
121 + + + + + + 180 

ACGAGGGGACGCTGAAGAGTAGATACCCCGtgCATGAGCGGTAtGAAAAGCGAGCTCCTG 

ELSVSSKLPKDCRITLVQVL 80 

GAGCTGTCCGTGTCGAGTAAGCTTCCCAAGGATTGCCGGATCACCTTGGTACAGGTGCTA 
181 + + + + + + 240 

CTCGACAGGCACAGCTCATTCGAAGGGTTCCTAACGGCCTAGTGGAACCATGTCCACGAT 

SRHGARYPTSSKSKKYKKLI 100 
TCGCGCCATGGAGCGCGGTACCCAACCAGCTCCAAGAGCAAAAAGTATAAGAAGCTTaTt 
241 + + + + + + 300 

AGCGCGGTACCTCGCGCCATGGGTTGGTCGAGGTTCTCGTTTTTCATATTCTTCGAAtAa 

TAIQANATDFKGKYAFLKTY 120 
ACGGCGATCCAGGCCAATGCCACCGACTTCAAGGGCAAGTacGCCTTTTTGAAGACGTAC 
301 + + + + + + 360 

TGCCGCTAGGTCCGGTTACGGTGGCTGAAGTTCCCGTTCAtgCGGAAAAACTTCTGCATG 

NYTLGADDLTPFGEQQLVNS 140 

AACTATACTCTGGGTGCGGATGACCTCACTCCCTTTGGGGAGCAGCAGCTGGTGAACTCG 
361 + + + + + + 42Q 

TTGATATGAGACCCACGCCTACTGGAGTGAGGGAAACCCCTCGTCGTCGACCACTTGAGC 

GIKFYQRY KALARSVVPFIR 160 

GGCATCAAGTTCTACCAGAGGTACAAGGCTCTGGCGCGCAGTGTGGTGCCGTTTATTCGC 
421 + + + + + + 430 

CCGTAGTTCAAGATGGTCTCCATGTTCCGAGACCGCGCGTCACACCACGGCAAAT7VAGCG 

ASGSDRVIASGEKFIEGFQQ 180 

GCCTCAGGCTCGGACCGGGTTATTGCTTCGGGAGAGAAGTTCATCGAGGGGTTCCAGCAG 
481 + + + + + + 540 

CGGAGTCCGAGCCTGGCCCAATAACGAAGCCCTCTCTTCAAGTAGCTCCCCAAGGTCGTC 

AKLAD PGATNRAAPAI SVI I 200 

GCGAAGCTGGCTGATCCTGGCGCGACGAACCGCGCCGCTCCGGCGATTAGTGTGATTATT 
541 + + + + + + goo 

CGCTTCGACCGACTAGGACCGCGCTGCTTGGCGCGGCGAGGCCGCTAATCACACTAATAA 

PESETFNNTLDHGVCTKFEA 220 

CCGGAGAGCGAGACGTTCAACT^TACGCTGGACCACGGTGTGTGCACGAAGTTTGAGGCG 
601 + + + + + + ego 

GGCCTCTCGCTCTGCAAGTTGTTATGCGACCTGGTGCCACACACGTGCTTCAAACTCCGC 
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SQLGDEVAANFTALFAPDIR 240 
AGTCAGCTGGGAGATGAGGTTGCGGCCAATTTCACTGCGCTCTTTGCACCCGACATCCGA 

661 + + + + + + 720 

TCAGTCGACCCTCTACTCCAACGCCGGTTAAAGTGACGCGAGAAACGTGGGCTGTAGGCT 

ARLEKHLPGVTLTDEDVVSL 260 
GCTCGCctCGAGAAGCATCTTCCTGGCGTGACGCTGACAGACGAGGACGTTGTCAGTCTA 

721 + + + + + + 780 

CGAGCGgaGCTCTTCGTAGAAGGACCGCACTGCGACTGTCTGCTCCTGCAACAGTCAGAT 

MDMCPFDTVARTSDASQLSP 280 
ATGGACATGTGTcCGTTTGATACGGTAGCGCGGACC^GCGACGCAAGTCAGCTGTCACCG 

781 + + + + + + 840 

TACCTGTACACAgGCAAACTATGCCATCGCGCGTGGTCGCTGCGTTCAGTCGACAGTGGC 

FCQLFTHNEWKKYDYLQSIjG 300 
TTCTGTCAACTCTTCACTCACAATGAGTGGAAGAAGTACgACTACCTTCAGTCCTTGGGC 

841 + + + + + + 900 

AAGACAGTTGAGAAGTGAGTGTTACTCACCTTCTTCATGcTGATGGAAGTCAGGAACCCG 

KYYGYGAGNPLGPAQGIGFT 320 
AAGTACTACGGCTACGGCGCAGGCAACCCTCTGGGACCGGCTCAGGGGATAGGGTTCACC 

901 + + + + + + 960 

TTCATGATGCCGATGCCGCGTCCGTTGGGAGACCCTGGCCGAGTCCCCTATCCCAAGTGG 

NELIARLTRSPVQDHTSTNS 340 
AACGAGCTGATTGCCCGGTTGACgCGTTCGCCAGTGCAGGACCACACCAGCACTAACTCG 

961 + + + + + + 1020 

TTGCTCGACTAACGGGCCAACTGcGCAAGCGGTCACGTCCTGGTGTGGTCGTGATTGAGC 

TLVSNPATFPLNATMYV DFS 360 
ACTCTAGTCTCCAACCCGGCCACCTTCCCGTTGAACGCTACCATGTACGTCGACTTTTCA 

1021 + + + + + + 1080 

TGAGATCAGAGGTTGGGCCGGTGGAAGGGCAACTTGCGATGGTACATGCAGCTGAAAAGT 

HDNSMVSI FFALGLYNGTEP 380 
CACGACAACAGCATGGTTTCCATCTTCTTTGCATTGGGCCTGTACAACGGCACTGAACCC 

1081 + + + + + + 1140 

GTGCTGTTGTCGTACCAAAGGTAGAAGAAACGTAACCCGGACATGTTGCCGTGACTTGGG 

LSRTSVESAKELDGYSASWV 400 
TTGTCCCGGACCTCGGTGGAAAGCGCCAAGGAATTGGATGGGTATTCTGCATCCTGGGTG 

1141 + + + + + + 1200 

AACAGGGCCTGGAGCCACCTTTCGCGGTTCCTTAACCTACCCATAAGACGTAGGACCCAC 

VPFGARAYFETMQCKSEKEP 420 
GTGCCTTTCGGCGCGCGAGCCTACTTCGAGACGATGCAATGCAAGTCGGAAAAGGAGCCT 

1201 + + + + + + 1260 

CACGGAAAGCCGCGCGCTCGGATGAAGCTCTGCTACGTTACGTTCAGCCTTTTCCTCGGA 

LVRALINDRVVPLHGCDVDK 440 
CTTGTTCGCGCTTTGATTAATGACCGGGTTGTGCCACTGCATGGCTGCGATGTGGACAAG 

1261 + + + + + + 1320 

GAACAAGCGCGAAACTAATTACTGGCCCAACACGGTGACGTACCGACGCTACACCTGTTC 
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LGRCKLNDFVKGLSWARSGG 460 

CTGGGGCGATGCAAGCTGAATGACTTTGTCAAGGGATTGAGTTGGGCCAGATCTGGGGGC 
1321 + + + + + + X380 

GACCCCGCTACGTTCGACTTACTGAAACAGTTCCCTAACTCAACCCGGTCTAGACCCCCG 

NWGECFS * 467 
AACTGGGG AGAGTG CTTTAGTTGA 

1381 + + 1404 

TTGACCCCTCTCACGAAATCAACT 
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CP-l 

Bco RI MGVFVVLLS IATLFGST 

TATATGAArTCATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

! + + + + + + 6Q 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHS CDTVDGG 

CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 
61 + + + + + + 12Q 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 

YQCFPEI SHLWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

EDESAIS PDVPDDCRVTFVQ 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 
181 + + + + + + 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4.7 

CP-5.7 

VLSRHGARYPTDS KGKKYSA 
AAGTTTTGTCTAGACACGGTGCT AGATACCCAACTgacTCTAAGgg t AAGa agTACTCTG 

241 + + + + + + 300 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAc tgAGATTCc C aTTC 1 1 C ATGAGAC 

LIEAIQKNATAFKGKYAFLK 

CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 260 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

361 ._ + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGIKFYRRYKALARKIVPF 

TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 -- + + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8.7 

CP- 9 

IRASGSSRVIASAEKFIEGF 

TCATTAGAGCTTCTGGTTCTtctAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 
481 + + + + + + 54Q 

AGTAATCTCGAAGACCAAGAagaTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

QSAKLADPGSQPHQAS PVID 

TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 +-- + + + + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 
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CP- 10 .7 

CP-11.7 



VI ISEASSYNNTLDPGTCTA 
ACGTTATTATT t C t GAc get TCTtc t TACAACAACACTTTGGACc c aGGTACTTGTACTG 
601 + + + + + + 66Q 

TGCAATAATAAagaCTgcgaAGGagaATGTTGTTGTGAAACCTGggtCCATGAACATGAC 

FEDSELADTVEANFTALFAP 

CTTTCGAAGACTCTGAATTGgc t GACac tGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 - + + + + + + 

GAAAGCTTCTGAGACTTAACcgaCTGtgaCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP-12.7 

AIRARLEADLPGVTLTDTEV 

CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACac t ga aG 
721 _____ +______ + + + + + 

GTCGATAATCTCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTG tgac 1 1 C 

CP-13.7 

TYLMDMCSFETVARTSDATE 
TTac tTACTTGATGGACATGTGTfcc tTTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

781 + + + + + + 840 

AAtgaATGAACTACCTGTACACAagaAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRHYDYLQ 

AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGAcacTACGACTACTTGC 
841 + + + + + + 9Q0 

TTAACAGAGGTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTgtgATGCTGATGAACG 

CP-14 .7 

CP-15 .7 

SLKKYYGHGAGNPLGPTQGV 

AATCTTTGaagAAGTACTACGGTcacGGTGCTGGTAACCCATTGGGTCCAactCAAGGTG 

901 + + + + + + g60 

TTAGAAAC 1 1 cTTCATGATGCCAg tgCCACGACCATTGGGTAACCCAGGT t gaGTTCCAC 

GFANELIARLTRSPVQDHTS 
TTGGTTTCGCTAACGAATTGATTGCTAGATTGACTAGATCTCCAGTTCAAGACCACACTT 
9 61 + + + + + + 1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 

CP-16 

CP-17.7 

TNHTLDSNPATFPLNATLYA 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 
1021 - + + + + + + 1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNGIISIFFALGLYNG 
CTGACTTCTCTCACGACAACgg fca 1 1 ATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 114Q 

GACTGAAGAGAGTGCTGTTGc ca t a aTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18.7 

CP-19.7 

TAPLSTTSVESIEETDGYSS 

GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTt 

1141 + + + . . 

+ + + + 1200 

CATGACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAa 
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AWTVPFASRAYVEMMQCQAE 
C tgc tTGGACTGTTCCATTCgc 1 1 C tAGAGCTTACGTTGAAATGATGCAATGTCAAGCTG 

1201 + + + + + + 1260 

gacg aACCTGACAAGGTAAGcg a agaTCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 

1261 + + + + + + 1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 



VDKLGRCKRDDFVEGIiSFAR 
CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 1380 

GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* Eco RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 

1381 + + + + 1426 

CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 
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1 MGVFWLLS I ATLFGSTSGT ALGPRGNSHS CDTVDGGYQC FPEISSNWSP 
51 YSPYFSLADE SAISPDVPKG CRVTFVQVLQ RHGARFPTSG AATRI SALIE 
101 AIQKNATAFK GKYAFLKTYN YTLGADDLVP FG ANQSSQA G IKFYRRYKAL 
151 ARKIVPFIRA SGSDRVIDSA TNW IEGFQSA KLADPGANPH QASPVINVII 

2 01 PEGAGYNNTL DHGLCTAFEE SELGDDVEAN FTAVFAP P I R ARLEAHIjPGV 

2 51 NLTDEDWNL MDMCPFDTVA RTSDATELSP FCDLFTHDEW IQYDYLGDLD 

3 01 KYYGTGAGNP LGPAQGVGFV NEL I ARLTHS PVQDHTS TNH TLDSNPATFP 
351 LNATLYADFS HDNTMVAIFF ALGLYNGTKP LSTTSVESIE ETDGYSASWL 

4 01 VPFSARMYVE MMQCEAEKEP LVRVLVNDRV VPLHGCGVDK LGRCKRDDFV 
451 EGLS FARSGG NWEECFA 
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ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 
1 + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

MGVFVVLLSIATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTC7VCTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
6! + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAAC^CTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCTCCATACTTCTCTTTGGCAGACGAA 
121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 

FPEISHLWGTYSPYFSLADE 

TCTGCTATTTCTCCAGACGTCCCAAAGGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 
181 + + + + + + 240 

AGACGATAAAGAGGTCTGCAGGGTTTCCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKDCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTGCTTTGATTGAA 
241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGACGAAACTAACTT 

RHGARYPTSSKSKAYSALIE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 
AIQKNATA. FKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACnTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 
361 + + + + + + 42Q 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 

YTLGADDLTPFGENQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 48O 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

IKFYRRYKALARKIVPFIRA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 540 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 
SGSDRVIASAEKFIEGFQSA 

Fig. 22a 
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AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 
541 + + + + + + 600 

TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 
KLADPGSQPHQASPVINVII 

CCAGAAGGATCCGGTTACAACAACACTTTGGACCATGGTCTTTGTACTGCTTTCGAAGAC 
601 + + + + + + 660 

GGTCTTCCTAGGCCAATGTTGTTGTGAAACCTGGTACCAGAAACATGACGAAAGCTTCTG 

PEGSGYNNTLDHGLCTAFED 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 
661 + + + + + + 7 2o 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 
STLGDDVEANFTALFAPAIR 

GCTAGATTGG7UVGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTGTTTACTTG 
721 + + + + + + 780 

CGATCTAACCTTCGACTGAACGGTCCAC7VATGAAACTGACTGCTTCTGCAACAAATGAAC 

ARLEADLPGVTLTDEDVVYL 

ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 
781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 

MDMCPFDTVARTSDATELSP 

TTCTGTGCTTTGTTCACTCACGACGAATGGATCCAATACGACTACTTGCAAAGCTTGGGT 
841 + + + + + + g0Q 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 
FCALFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 
901 --- + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCGA 

KYYGYGAGNPLGPAQGVGFA 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 
NELIARLTHSPVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 -------- _ + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 
TLDSNPATFPLNATLYADFS 
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CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 
1081 + + + + + + 1140 

GTGCTGTTGTGATACTATAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGGTTCGGT 

HDNTMISIFFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 
1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 
LSTTSVES IEETDGYSASWT 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTGAAAAGGAACCA 
1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 
VPFAARAYVEMMQCQAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 
1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVNDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 
1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

LGRCKRDDFVEGLS FARSGG 

AACTGGGCTGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCGACTTACAAAGCGAATT 

NWAECFA* 



Fig. 22c 



WO 00/43503 



PCT/DK00/00025 



48/56 



ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 
1 + + + + + + g Q 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 

MGVFVVLLSIATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
61 + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCTCCATACTTCTCTTTGGCAGACGAA 
121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 
FPE I SHLWGTYSPYF SLADE 

TCTGCTATTTCTCCAGACGTCCCAAAGGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 
181 + + + + + + 240 

AGACGAT/^AAGAGGTCTGCAGGGTTTCCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKDCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 
241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 

RHGARYPTSSASKAYSALIE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 
A I QKNATA.FKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 
361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 

YTLGADDLTPFGENQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 430 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 
IKFYRRYKALARKIVPFIRA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 54Q 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTC7VAGTAACTTCCAAAGGTTAGACGA 
SGSDRVIASAEKFIEGFQSA 



Fig. 23a 
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AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 
541 + + + + + + goo 

TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 
KLADPGSQPHQASPVINVII 

CCAGAAGGATCCGGTTACAACAACACTTTGGACCATGGTCTTTGTACTGCTTTCGAAGAC 
601 --- + + + + + + ego 

GGTCTTCCTAGGCCAATGTTGTTGTGAAACCTGGTACCAGAAACATGACGAAAGCTTCTG 

PEGSGYNNTLDHGLCTAFED 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 
661 + + + + + + 720 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 



STLGDDVEANFTALFAPAI 



R 



GCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTGTTTACTTG 
721 ------ + + + + + + 7Qo 

CGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGCAACAAATGAAC 

ARLEADLPGVTLTDEDVVYL 

ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 
781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 
MDMCP FDTVARTSDATELSP 

TTCTGTGCTTTGTTCACTCACGACGAATGGATCCAATACGACTACTTGCAAAGCTTGGGT 
841 + + + -f- + + 900 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 

FCALFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 
901 + + + + + + g60 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCGA 

KYYGYGAGNPLGPAQGVGFA 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

NELIARLTHSPVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 + + + + + + 108() 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 



TLDSNPATFPLNATLYADF 



S 



Fig. 23b 
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CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 

1081 + + + + + + 1140 

GTGCTGTTGTGATACTATAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGGTTCGGT 

HDNTM IS I FFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACXX3TTACTCTGCTTCTTGGACT 
1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 
LSTTSVES IEETDGYSASWT 

GTTCCA.TTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTGAAAAGGAACCA 
1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 

VPFAARAYVEMMQCQAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 
1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTC7VACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVNDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 
1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 
LGRCKRDDFVEGLSFARSGG 

AACTGGGCTGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCGACTTACAAAGCGAATT 

NWAECFA* 



Fig. 23c 



WO 00/43503 
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PCT/DKOO/00025 



ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

x + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 
MGVFVVLLS IATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
6! + + + + + + 12Q 

CGGAACCCAGGAGCACC^TTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 
121 + + + + + + xBO 

AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTAAGAAGAGAAACCGACTGCTT 

FPEISHLWGTYSPFFSLADE 

TCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 
181 + + + + + + 240 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKGCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTGCTTTGATTGAA 
241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGACGAAACTAACTT 

RHGARYPTSSKSKAYSALIE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAT 
301 + + + + + + 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTA 

AIQKNATA.FKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 
361 + + + + + + 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 

YTLGADDLTPFGEQQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 480 

TAATTCT^GATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

IKFYRRYKALARKIVPFIRA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCCGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 54Q 

AGACCAAGACTGTCTCAATAACGAAGACGGCTTTTCAAGTAACTTCCAAAGGTTAGACGA 
SGSDRVIASAEKFIEGFQSA 



Fig. 24a 
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AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTAACGTTATTATT 
541 + + + + + + 600 

TTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAATTGCAATAATAA 

KLADPGANPHQASPVINVII 

C CAGAAGG TG CTGG TT ACAACAACACTTTGG ACCACGGTTTGTGT ACTG C TTTCGAAGAA 
601 + + + + + + 660 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 

PEGAGYNNTLDHGLCTAFEE 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 
661 + + + + + + 720 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 
STLGDDVEANFTAVFAPPIR 

GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 
721 + + + + + + 780 

CGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 

ARLEAHLPGVNLTDEDVVNL 

ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCCA 
781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 
MDMCPFDTVARTSDATQLSP 

TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 
841 + + + + + + goo 

AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 

FCDLFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 
901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCAA 

KYYGYGAGNPLGPAQGVGFV 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + " + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 
NELIARLTHS PVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 - + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 
TLDSNPATFPLNATLYADFS 



Fig. 24b 
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CACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAA.GCCA 
1081 + + + + + + 1140 

GTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGATTCGGT 
HDNTMVS IFFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 
1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

LSTTSVESIEETDGYSASW T 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 
1201 + + + + + + 1260 

CAAGGT7VAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 

VPFAARAYVEMMQCEAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 
1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVNDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 
1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

LGRCKRDDFVEGLS FARSGG 

AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 

NWEECFA* 



Fig. 24c 
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PCT/DKOO/00025 



ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 
X + + + + + + 60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 
MGVFVVLLS IATLFGSTSGT 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
61 + + + + + + 120 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

ALGPRGNSHSCDTVDGGYQC 

TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 
121 + + + + + + 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTJVAGAAGAGAAACCGACTGCTT 

FPEISHLWGTYSPFFSLADE 

TCTGCTATTTCTC<^GACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 
!8l + + + + + + 24Q 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 

SAISPDVPKGCRVTFVQVLS 

AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 
241 + + + + + + 300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 
RHGARYPTSSAS KAYSAL IE 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAT 
301 + + + + + + 36O 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTA 

AIQKNATAFKGKYAFLKTYN 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 
361 + + + + + 4- 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 

YTLGADDLTPFGEQQMVNSG 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

IKFYRRYKALARKIVPFIRA 

TCTGGTTCTGACAGAGTTATTGCTTCTGCCGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 54Q 

AGACCAAGACTGTCTCAATAACGAAGACGGCTTTTCAAGTAACTTCCAAAGGTTAGACGA 
SGSDRVIASAEKF IEGFQSA 



Fig. 25a 
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AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTAACGTTATTATT 

541 + + + + + + 600 

TTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAATTGCAATAATAA 

KLADPGANPHQASPVINVII 

CCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTGCTT^ 

601 + + + + + + 660 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 

PEGAGYNNTLDHGLCTAFEE 

TCTACCCTAGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 

661 + + + + + + 720 

AGATGGGATCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 

STLGDDVEANFTAVFAPPIR 

GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 
721 + + + + + + 780 

CGATC^AACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 
ARLEAHLPGVNLTDEDVVNL 

ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCCA 

781 + + + + + + 840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 

MDMCPFDTVARTSDATQLSP 

TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 
841 + + + + + + 900 

AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 

FCDLFTHDEWIQYDYLQSLG 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 
901 + + + + + + 960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCAA 

KYYGYGAGNPLGPAQGVGFV 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 
NELIARLTHS PVQDHTSTNH 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 + + + + + + 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 
TLDSNPATFPLNATLYADFS 



Fig. 25b 
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CACGACAA.CACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAAGCC^ 

1081 + + + + + + 1140 

GTG CTGTTGTG AT AC CAAAGAT AAAAG AAG CGAAAC C CAAACATGTTG C C ATG ATTCGGT 

HDNTMVS IFFALGLYNGTKP 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACX3GTTACTCTGCTTCTTGGACT 
1141 + + + + + + 1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAA^ 

LSTTSVESIEETDGYSASWT 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 
1201 + + + + + + 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 

VPFAARAYVEMMQCEAEKEP 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 
1261 + + + + + + 1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 

LVRVLVNDRVVPLHGCAVDK 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 
1321 + + + + + + 1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCC7VAACAGAAAGCGATCTAGACCACCA 
LGRCKRDDFVEGLSFARSGG 

AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 

NWEECFA* 



Fig. 25c 
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SEQUENCE LISTING 

<110> Novo Nordisk A/S 

<12 0> Improved phytases 

<130> seqlistl71299 

<140> 
<141> 

<150> DK 99/00092 
<151> 1999-01-22 

<150> DK 99/01340 
<151> 1999-09-21 

<160> 89 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 440 
<212> PRT 

<213> Aspergillus terreus 9A-1 
<400> l 

Lys His Ser Asp Cys Asn Ser Val Asp His Gly Tyr Gin Cys Phe Pro 
15 10 15 

Glu Leu Ser His Lys Trp Gly Leu Tyr Ala Pro Tyr Phe Ser Leu Gin 

20 25 30 

Asp Glu Ser Pro" Phe Pro Leu Asp Val Pro Glu Asp Cys His lie Thr 
35 40 45 

Phe Val Gin Val Leu Ala Arg His Gly Ala Arg Ser Pro Thr His Ser 
50 55 60 

Lys Thr Lys Ala Tyr Ala Ala Thr He Ala Ala He Gin Lys Ser Ala 
65 70 75 80 

Thr Ala Phe Pro Gly Lys Tyr Ala Phe Leu Gin Ser Tyr Asn Tyr Ser 

85 90 95 

Leu Asp Ser Glu Glu Leu Thr Pro Phe Gly Arg Asn Gin Leu Arg Asp 

100 105 110 

Leu Gly Ala Gin Phe Tyr Glu Arg Tyr Asn Ala Leu Thr Arg His He 
115 120 125 

Asn Pro Phe Val Arg Ala Thr Asp Ala Ser Arg Val His Glu Ser Ala 
130 135 140 
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Glu Lys Phe Val Glu Gly Phe Gin Thr Ala Arg Gin Asp Asp His His 
145 150 155 160 

Ala Asn Pro His Gin Pro Ser Pro Arg Val Asp Val Ala He Pro Glu 

165 170 175 

Gly Ser Ala Tyr Asn Asn Thr Leu Glu His Ser Leu Cys Thr Ala Phe 

180 185 190 

Glu Ser Ser Thr Val Gly Asp Asp Ala Val Ala Asn Phe Thr Ala Val 
1^5 200 205 

Phe Ala Pro Ala He Ala Gin Arg Leu Glu Ala Asp Leu Pro Gly Val 
210 215 220 

Gin Leu Ser Thr Asp Asp Val Val Asn Leu Met Ala Met Cys Pro Phe 
225 230 235 240 

Glu Thr Val Ser Leu Thr Asp Asp Ala His Thr Leu Ser Pro Phe Cys 

245 250 255 

Asp Leu Phe Thr Ala Thr Glu Trp Thr Gin Tyr Asn Tyr Leu Leu Ser 

260 265 270 

Leu Asp Lys Tyr Tyr Gly Tyr Gly Gly Gly Asn Pro Leu Gly Pro Val 
275 280 285 

Gin Gly Val Gly Trp Ala Asn Glu Leu Met Ala Arg Leu Thr Arg Ala 
290 295 300 

Pro Val His Asp His Thr Cys Val Asn Asn Thr Leu Asp Ala Ser Pro 
305 310 315 320 

Ala Thr Phe Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe Ser His Asp 

325 330 335 

Ser Asn Leu Val Ser He Phe Trp Ala Leu Gly Leu Tyr Asn Gly Thr 

340 345. 350 

Ala Pro Leu Ser Gin Thr Ser Val Glu Ser Val Ser Gin Thr Asp Gly 
355 360 365 

Tyr Ala Ala Ala Trp Thr Val Pro Phe Ala Ala Arg Ala Tyr Val Glu 
370 375 380 

Met Met Gin Cys Arg Ala Glu Lys Glu Pro Leu Val Arg Val Leu Val 
385 390 395 400 

Asn Asp Arg Val Met Pro Leu His Gly Cys Pro Thr Asp Lys Leu Gly 

405 410 415 

Arg Cys Lys Arg Asp Ala Phe Val Ala Gly Leu Ser Phe Ala Gin Ala 

420 425 430 
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Gly Gly Asn Trp Ala Asp Cys Phe 
435 440 



<210> 2 
<211> 440 
<212> PRT 

<213> Aspergillus terreus cbs 
<400> 2 

Asn His Ser Asp Cys Thr Ser Val Asp Arg Gly Tyr Gin Cys Phe Pro 
15 10 15 

Glu Leu Ser His Lys Trp Gly Leu Tyr Ala Pro Tyr Phe Ser Leu Gin 

20 25 30 

Asp Glu Ser Pro Phe Pro Leu Asp Val Pro Asp Asp Cys His lie Thr 
35 40 45 

Phe Val Gin Val Leu Ala Arg His Gly Ala Arg Ser Pro Thr Asp Ser 
50 55 ~ 60 

Lys Thr Lys Ala Tyr Ala Ala Thr lie Ala Ala lie Gin Lys Asn Ala 
65 70 75 80 

Thr Ala Leu Pro Gly Lys Tyr Ala Phe Leu Lys Ser Tyr Asn Tyr Ser 

85 90 95 

Met Gly Ser Glu Asn Leu Thr Pro Phe Gly Arg Asn Gin Leu Gin Asp 

100 105 110 

Leu Gly Ala Gin Phe Tyr Arg Arg Tyr Asp Thr Leu Thr Arg His lie 
115 120 125 

Asn Pro Phe Val Arg Ala Ala Asp Ser Ser Arg Val His Glu Ser Ala 
130 135 140 

Glu Lys Phe Val Glu Gly Phe Gin Asn Ala Arg Gin Gly Asp Pro His 
145 150 155 160 

Ala Asn Pro His Gin Pro Ser Pro Arg Val Asp Val Val lie Pro Glu 

165 170 175 

Gly Thr Ala Tyr Asn Asn Thr Leu Glu His Ser lie Cys Thr Ala Phe 

180 185 190 

Glu Ala Ser Thr Val Gly Asp Ala Ala Ala Asp Asn Phe Thr Ala Val 
195 200 205 

Phe Ala Pro Ala lie Ala Lys Arg Leu Glu Ala Asp Leu Pro Gly Val 
210 215 220 

Gin Leu Ser Ala Asp Asp Val Val Asn Leu Met Ala Met Cys Pro Phe 
225 230 235 240 
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Glu Thr Val Ser Leu Thr Asp Asp Ala His Thr Leu Ser Pro Phe Cys 

245 250 255 

Asp Leu Phe Thr Ala Ala Glu Trp Thr Gin Tyr Asn Tyr Leu Leu Ser 

260 265 270 

Leu Asp Lys Tyr Tyr Gly Tyr Gly Gly Gly Asn Pro Leu Gly Pro Val 
275 280 285 

Gin Gly Val Gly Trp Ala Asn Glu Leu lie Ala Arg Leu Thr Arg Ser 
290 295 300 

Pro Val His Asp His Thr Cys Val Asn Asn Thr Leu Asp Ala Asn Pro 
305 310 315 320 

Ala Thr Phe Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe Ser His Asp 

325 330 335 

Ser Asn Leu Val Ser lie Phe Trp Ala Leu Gly Leu Tyr Asn Gly Thr 

340 345 350 

Lys Pro Leu Ser Gin Thr Thr Val Glu Asp He Thr Arg Thr Asp Gly 
355 360 365 

Tyr Ala Ala Ala Trp Thr Val Pro Phe Ala Ala Arg Ala Tyr He Glu 
370 375 380 

Met Met Gin Cys Arg Ala Glu Lys Gin Pro Leu Val Arg Val Leu Val 
385 390 395 400 

Asn Asp Arg Val Met Pro Leu His Gly Cys Ala Val Asp Asn Leu Gly 

405 410 415 

Arg Cys Lys Arg Asp Asp Phe Val Glu Gly Leu Ser Phe Ala Arg Ala 

420 425 430 

Gly Gly Asn Trp Ala Glu Cys Phe 
435 440 



<210> 3 
<211> 441 
<212> PRT 

<213> Aspergillus niger var. awamori 
<400> 3 

Asn Gin Ser Thr Cys Asp Thr Val Asp Gin Gly Tyr Gin Cys Phe Ser 
15 10 15 



Glu Thr Ser His Leu Trp Gly Gin Tyr Ala Pro Phe Phe Ser Leu Ala 

20 25 30 
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Asn Glu Ser Ala He Ser Pro Asp Val Pro Ala Gly Cys Arg Val Thr 
35 4 0 45 

Phe Ala Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Glu Ser 
50 55 60 

Lys Gly Lys Lys Tyr Ser Ala Leu He Glu Glu He Gin Gin Asn Val 
65 70 75 80 

Thr Thr Phe Asp Gly Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Ser 

85 90 95 

Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu Gin Glu Leu Val Asn 

100 105 no 

Ser Gly He Lys Phe Tyr Gin Arg Tyr Glu Ser Leu Thr Arg Asn He 
115 120 125 

He Pro Phe He Arg Ser Ser Gly Ser Ser Arg Val He Ala Ser Gly 
130 135 140 

Glu Lys Phe He Glu Gly Phe Gin Ser Thr Lys Leu Lys Asp Pro Arg 
145 150 155 160 

Ala Gin Pro Gly Gin Ser Ser Pro Lys He Asp Val Val He Ser Glu 

165 170 175 

Ala Ser Ser Ser Asn Asn Thr Leu Asp Pro Gly Thr Cys Thr Val Phe 

180 185 190 

Glu Asp Ser Glu Leu Ala Asp Thr Val Glu Ala Asn Phe Thr Ala Thr 
195 200 205 

Phe Ala Pro Ser He Arg Gin Arg Leu Glu Asn Asp Leu Ser Gly Val 
210 215 220 

Thr Leu Thr Asp Thr Glu Val Thr Tyr Leu Met Asp Met Cys Ser Phe 
225 230 235 240 

Asp Thr He Ser Thr Ser Thr Val Asp Thr Lys Leu Ser Pro Phe Cys 

245 250 255 

Asp Leu Phe Thr His Asp Glu Trp He His Tyr Asp Tyr Leu Gin Ser 

260 265 270 

Leu Lys Lys Tyr Tyr Gly His Gly Ala Gly Asn Pro Leu Gly Pro Thr 
275 280 285 

Gin Gly Val Gly Tyr Ala Asn Glu Leu He Ala Arg Leu Thr His Ser 
290 295 300 

Pro Val His Asp Asp Thr Ser Ser Asn His Thr Leu Asp Ser Asn Pro 
305 310 315 320 
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Ala Thr Phe Pro Leu Asn Ser Thr Leu Tyr Ala Asp Phe Ser His Asp 

325 330 335 

Asn Gly lie lie Ser lie Leu Phe Ala Leu Gly Leu Tyr Asn Gly Thr 

340 345 350 

Lys Pro Leu Ser Thr Thr Thr Val Glu Asn lie Thr Gin Thr Asp Gly 
355 360 365 

Phe Ser Ser Ala Trp Thr Val Pro Phe Ala Ser Arg Leu Tyr Val Glu 
370 375 380 

Met Met Gin Cys Gin Ala Glu Gin Glu Pro Leu Val Arg Val Leu Val 
385 390 395 400 

Asn Asp Arg Val Val Pro Leu His Gly Cys Pro lie Asp Ala Leu Gly 

405 410 415 

Arg Cys Thr Arg Asp Ser Phe Val Arg Gly Leu Ser Phe Ala Arg Ser 

420 425 430 

Gly Gly Asp Trp Ala Glu Cys Ser Ala 
435 440 



<210> 4 
<211> 441 
<212> PRT 

<213> Aspergillus niger T213 
<400> 4 

Asn Gin Ser Ser Cys Asp Thr Val Asp Gin Gly Tyr Gin Cys Phe Ser 
1 5 10 15 

Glu Thr Ser His Leu Trp Gly Gin Tyr Ala Pro Phe Phe Ser Leu Ala 

20 25 30 

Asn Glu Ser Val lie Ser Pro Asp Val Pro Ala Gly Cys Arg Val Thr 
35 40 45 

Phe Ala Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Glu Ser 
50 55 60 

Lys Gly Lys Lys Tyr Ser Ala Leu He Glu Glu He Gin Gin Asn Val 
65 70 75 80 

Thr Thr Phe Asp Gly Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Ser 

85 90 95 

Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu Gin Glu Leu Val Asn 

100 105 no 



Ser Gly He 
115 



Lys Phe Tyr Gin 



Arg Tyr Glu Ser Leu Thr Arg Asn He 
120 125 
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lie Pro Phe He Arg Ser Ser Gly Ser Ser Arg Val He Ala Ser Gly 
130 135 140 

Glu Lys Phe He Glu Gly Phe Gin Ser Thr Lys Leu Lys Asp Pro Arg 
145 150 155 160 

Ala Gin Pro Gly Gin Ser Ser Pro Lys He Asp Val Val He Ser Glu 

165 170 175 

Ala Ser Ser Ser Asn Asn Thr Leu Asp Pro Gly Thr Cys Thr Val Phe 

180 185 190 

Glu Asp Ser Glu Leu Ala Asp Thr Val Glu Ala Asn Phe Thr Ala Thr 
195 200 205 

Phe Ala Pro Ser He Arg Gin Arg Leu Glu Asn Asp Leu Ser Gly Val 
210 215 220 

Thr Leu Thr Asp Thr Glu Val Thr Tyr Leu Met Asp Met Cys Ser Phe 
225 230 235 240 

Asp Thr He Ser Thr Ser Thr Val Asp Thr Lys Leu Ser Pro Phe Cys 

245 250 255 

Asp Leu Phe Thr His Asp Glu Trp He His Tyr Asp Tyr Leu Arg Ser 

260 265 270 

Leu Lys Lys Tyr Tyr Gly His Gly Ala Gly Asn Pro Leu Gly Pro Thr 
275 280 285 

Gin Gly Val Gly Tyr Ala Asn Glu Leu He Ala Arg Leu Thr His Ser 
290 295 300 

Pro Val His Asp Asp Thr Ser Ser Asn His Thr Leu Asp Ser Asn Pro 
305 310 315 320 

Ala Thr Phe Pro Leu Asn Ser Thr Leu Tyr Ala Asp Phe Ser His Asp 

325 330 335 

Asn Gly He He Ser He Leu Phe Ala Leu Gly Leu Tyr Asn Gly Thr 

340 345 350 

Lys Pro Leu Ser Thr Thr Thr Val Glu Asn He Thr Gin Thr Asp Gly 
355 360 3 6 5 

Phe Ser Ser Ala Trp Thr Val Pro Phe Ala Ser Arg Leu Tyr Val Glu 
370 375 380 

Met Met Gin Cys Gin Ala Glu Gin Glu Pro Leu Val Arg Val Leu Val 
385 390 395 400 

Asn Asp Arg Val Val Pro Leu His Gly Cys Pro He Asp Ala Leu Gly 

405 410 415 
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Arg Cys Thr Arg Asp Ser Phe Val 

420 

Gly Gly Asp Trp Ala Glu Cys Phe 
435 440 



Arg Gly Leu Ser Phe Ala Arg Ser 
425 430 

Ala 



<210> 5 
<211> 441 
<212> PRT 

<213> Aspergillus niger NRRL3135 



<400> 5 

Asn Gin Ser Ser 
1 

Glu Thr Ser His 

20 

Asn Glu Ser Val 
35 

Phe Ala Gin Val 
50 

Lys Gly Lys Lys 
65 

Thr Thr Phe Asp 



Leu Gly Ala Asp 

100 

Ser Gly He Lys 
115 

Val Pro Phe He 
130 



Cys Asp Thr Val 
5 

Leu Trp Gly Gin 



He Ser Pro Glu 

40 

Leu Ser Arg His 
55 

Tyr Ser Ala Leu 
70 

Gly Lys Tyr Ala 
85 

Asp Leu Thr Pro 



Phe Tyr Gin Arg 

120 

Arg Ser Ser Gly 
135 



Asp Gin Gly Tyr 
10 

Tyr Ala Pro Phe 
25 

Val Pro Ala Gly 



Gly Ala Arg Tyr 

60 

He Glu Glu He 
75 

Phe Leu Lys Thr 
90 

Phe Gly Glu Gin 
105 

Tyr Glu Ser Leu 



Ser Ser Arg Val 

140 



Gin Cys Phe Ser 
15 

Phe Ser Leu Ala 
30 

Cys Arg Val Thr 
45 

Pro Thr Asp Ser 



Gin Gin Asn Ala 

80 

Tyr Asn Tyr Ser 
95 

Glu Leu Val Asn 
110 

Thr Arg Asn He 
125 

lie Ala Ser Gly 



Lys Lys Phe He 
145 

Ala Gin Pro Gly 



Ala Ser Ser Ser 

180 

Glu Asp Ser Glu 
195 



Glu Gly Phe Gin 
150 

Gin Ser Ser Pro 
165 

Asn Asn Thr Leu 



Leu Ala Asp Thr 

200 



Ser Thr Lys Leu 
155 

Lys He Asp Val 
170 

Asp Pro Gly Thr 
185 

Val Glu Ala Asn 



Lys Asp Pro Arg 

160 

Val He Ser Glu 
175 

Cys Thr Val Phe 
190 

Phe Thr Ala Thr 
205 
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Phe Val Pro Ser He Arg Gin Arg Leu Glu Asn Asp Leu Ser Gly Val 
210 215 220 

Thr Leu Thr Asp Thr Glu Val Thr Tyr Leu Met Asp Met Cys Ser Phe 
225 230 235 240 

Asp Thr He Ser Thr Ser Thr Val Asp Thr Lys Leu Ser Pro Phe Cys 

245 250 255 

Asp Leu Phe Thr His Asp Glu Trp He Asn Tyr Asp Tyr Leu Gin Ser 

260 265 270 

Leu Lys Lys Tyr Tyr Gly His Gly Ala Gly Asn Pro Leu Gly Pro Thr 
275 280 285 

Gin Gly Val Gly Tyr Ala Asn Glu Leu He Ala Arg Leu Thr His Ser 
290 295 300 

Pro Val His Asp Asp Thr Ser Ser Asn His Thr Leu Asp Ser Ser Pro 
305 310 315 320 

Ala Thr Phe Pro Leu Asn Ser Thr Leu Tyr Ala Asp Phe Ser His Asp 

32 5 330 335 

Asn Gly He He Ser He Leu Phe Ala Leu Gly Leu Tyr Asn Gly Thr 

340 345 350 

Lys Pro Leu Ser Thr Thr Thr Val Glu Asn He Thr Gin Thr Asp Gly 
355 360 365 

Phe Ser Ser Ala Trp Thr Val Pro Phe Ala Ser Arg Leu Tyr Val Glu 
370 375 380 

Met Met Gin Cys Gin Ala Glu Gin Glu Pro Leu Val Arg Val Leu Val 
385 390 395 400 

Asn Asp Arg Val Val Pro Leu His Gly Cys Pro Val Asp Ala Leu Gly 

405 410 J 415 

Arg Cys Thr Arg Asp Ser Phe Val Arg Gly Leu Ser Phe Ala Arg Ser 

420 425 430 

Gly Gly Asp Trp Ala Glu Cys Phe Ala 
435 440 



<210> 6 
<211> 440 
<212> PRT 

<213> Aspergillus fumigatus 13073 
<400> 6 

Gly Ser Lys Ser Cys Asp Thr Val Asp Leu Gly Tyr Gin Cys Ser Pro 
1 5 10 * 15 
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Ala Thr Ser His Leu Trp Gly Gin Tyr Ser Pro Phe Phe Ser Leu Glu 

20 25 30 

Asp Glu Leu Ser Val Ser Ser Lys Leu Pro Lys Asp Cys Arg lie Thr 
35 40 45 

Leu Val Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser 
50 55 60 

Lys Ser Lys Lys Tyr Lys Lys Leu Val Thr Ala He Gin Ala Asn Ala 
65 70 75 80 

Thr Asp Phe Lys Gly Lys Phe Ala Phe Leu Lys Thr Tyr Asn Tyr Thr 

85 90 95 

Leu Gly Ala Asp Asp Leu Phe Gin Gin Ala Lys Leu Ala Asp Pro Gly 

100 105 no 

Ala Thr Asn Arg Ala Ala Pro Ala He Ser Val He He Pro Glu Ser 
115 120 125 

Glu Thr Phe Asn Asn Thr Leu Asp His Gly Val Cys Thr Lys Phe Glu 
130 135 140 

Ala Ser Gin Leu Thr Pro Phe Gly Glu Gin Gin Leu Val Asn Ser Gly 
145 15 ° 155 160 

He Lys Phe Tyr Gin Arg Tyr Lys Ala Leu Ala Arg Ser Val Val Pro 

165 170 175 

Phe He Arg Ala Ser Gly Ser Asp Arg Val He Ala Ser Gly Glu Lys 

180 185 190 

Phe He Glu Gly Gly Asp Glu Val Ala Ala Asn Phe Thr Ala Leu Phe 
195 200 205 

Ala Pro Asp He Arg Ala Arg Ala Glu Lys His Leu Pro Gly Val Thr 
210 215 220 

Leu Thr Asp Glu Asp Val Val Ser Leu Met Asp Met Cys Ser Phe Asp 
225 23 ° 235 240 

Thr Val Ala Arg Thr Ser Asp Ala Ser Gin Leu Ser Pro Phe Cys Gin 

245 250 255 

Leu Phe Thr His Asn Glu Trp Lys Lys Tyr Asn Tyr Leu Gin Ser Leu 

260 265 270 

Gly Lys Tyr Tyr Gly Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin 
275 280 285 

Gly He Gly Phe Thr Asn Glu Leu He Ala Arg Leu Thr Arg Ser Pro 
290 295 300 
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Val Gin Asp His 
305 

Thr Phe Pro Leu 



Ser Met Val Ser 

340 

Pro Leu Ser Arg 
355 

Ser Ala Ser Trp 
370 

Met Gin Cys Lys 
385 

Asp Arg Val Val 



Cys Lys Leu Asn 

420 

Gly Asn Trp Gly 
435 



Thr Ser Thr Asn 
310 

Asn Ala Thr Met 
325 

lie Phe Phe Ala 



Thr Ser Val Glu 

360 

Val Val Pro Phe 
375 

Ser Glu Lys Glu 
390 

Pro Leu His Gly 
405 

Asp Phe Val Lys 



Glu Cys Phe Ser 

440 



Ser Thr Leu Val 
315 

Tyr Val Asp Phe 
330 

Leu Gly Leu Tyr 
345 

Ser Ala Lys Glu 



Gly Ala Arg Ala 

380 

Pro Leu Val Arg 
395 

Cys Asp Val Asp 
410 

Gly Leu Ser Trp 
425 



Ser Asn Pro Ala 

320 

Ser His Asp Asn 
335 

Asn Gly Thr Glu 
350 

Leu Asp Gly Tyr 
365 

Tyr Phe Glu Thr 



Ala Leu lie Asn 

400 

Lys Leu Gly Arg 
415 

Ala Arg Ser Gly 
430 



<210> 7 
<211> 440 
<212> PRT 

<213> Aspergillus fumigatus 32722 
<400> 7 

Gly Ser Lys Ser Cys Asp Thr Val Asp Leu Gly Tyr Gin Cys Ser Pro 
15 io 15 

Ala Thr Ser His Leu Trp Gly Gin Tyr Ser Pro Phe Phe Ser Leu Glu 

20 25 30 

Asp Glu Leu Ser Val Ser Ser Lys Leu Pro Lys Asp Cys Arg lie Thr 
35 40 ~ 45 

Leu Val Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser 
50 55 60 

Lys Ser Lys Lys Tyr Lys Lys Leu Val Thr Ala He Gin Ala Asn Ala 
65 70 75 80 

Thr Asp Phe Lys Gly Lys Phe Ala Phe Leu Lys Thr Tyr Asn Tyr Thr 

85 90 95 



WO 00/43503 



PCT/DK00/00025 



12 

Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu Gin Gin Leu Val Asn 

100 105 HO 

Ser Gly lie Lys Phe Tyr Gin Arg Tyr Lys Ala Leu Ala Arg Ser Val 
115 120 125 

Val Pro Phe He Arg Ala Ser Gly Ser Asp Arg Val He Ala Ser Gly 
130 135 140 

Glu Lys Phe He Glu Gly Phe Gin Gin Ala Lys Leu Ala Asp Pro Gly 
I 45 150 155 160 

Ala Thr Asn Arg Ala Ala Pro Ala He Ser Val He He Pro Glu Ser 

165 170 175 

Glu Thr Phe Asn Asn Thr Leu Asp His Gly Val Cys Thr Lys Phe Glu 

180 185 190 

Ala Ser Gin Leu Gly Asp Glu Val Ala Ala Asn Phe Thr Ala Leu Phe 
195 200 205 

Ala Pro Asp He Arg Ala Arg Ala Glu Lys His Leu Pro Gly Val Thr 
210 215 220 

Leu Thr Asp Glu Asp Val Val Ser Leu Met Asp Met Cys Ser Phe Asp 
225 230 235 240 

Thr Val Ala Arg Thr Ser Asp Ala Ser Gin Leu Ser Pro Phe Cys Gin 

245 250 255 

Leu Phe Thr His Asn Glu Trp Lys Lys Tyr Asn Tyr Leu Gin Ser Leu 

260 265 270 

Gly Lys Tyr Tyr Gly Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin 
275 280 285 

Gly He Gly Phe Thr Asn Glu Leu He Ala Arg Leu Thr Arg Ser Pro 
290 295 300 

Val Gin Asp His Thr Ser Thr Asn Ser Thr Leu Val Ser Asn Pro Ala 
305 310 315 320 

Thr Phe Pro Leu Asn Ala Thr Met Tyr Val Asp Phe Ser His Asp Asn 

325 330 335 

Ser Met Val Ser He Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Gly 

340 345 350 

Pro Leu Ser Arg Thr Ser Val Glu Ser Ala Lys Glu Leu Asp Gly Tyr 
355 360 365 

Ser Ala Ser Trp Val Val Pro Phe Gly Ala Arg Ala Tyr Phe Glu Thr 
370 375 380 
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Met Gin Cys Lys Ser Glu Lys Glu Pro Leu Val Arg Ala Leu He Asn 
385 390 395 400 

Asp Arg Val Val Pro Leu His Gly Cys Asp Val Asp Lys Leu Gly Arg 

405 410 415 

Cys Lys Leu Asn Asp Phe Val Lys Gly Leu Ser Trp Ala Arg Ser Gly 

420 425 430 

Gly Asn Trp Gly Glu Cys Phe Ser 
435 440 



<210> 8 
<211> 440 
<212> PRT 

<213> Aspergillus fumigatus 58128 
<400> 8 

Gly Ser Lys Ser Cys Asp Thr Val Asp Leu Gly Tyr Gin Cys Ser Pro 
15 io 15 

Ala Thr Ser His Leu Trp Gly Gin Tyr Ser Pro Phe Phe Ser Leu Glu 

20 25 30 

Asp Glu Leu Ser Val Ser Ser Lys Leu Pro Lys Asp Cys Arg He Thr 
35 40 45 

Leu Val Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser 
50 55 60 

Lys Ser Lys Lys Tyr Lys Lys Leu Val Thr Ala He Gin Ala Asn Ala 
65 70 75 80 

Thr Asp Phe Lys Gly Lys Phe Ala Phe Leu Lys Thr Tyr Asn Tyr Thr 

85 90 95 

Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu Gin Gin Leu Val Asn 

100 105 no 

Ser Gly He Lys Phe Tyr Gin Arg Tyr Lys Ala Leu Ala Arg Ser Val 
115 120 125 

Val Pro Phe He Arg Ala Ser Gly Ser Asp Arg Val He Ala Ser Gly 
130 135 140 

Glu Lys Phe He Glu Gly Phe Gin Gin Ala Lys Leu Ala Asp Pro Gly 
i45 150 155 160 

Ala Thr Asn Arg Ala Ala Pro Ala He Ser Val He He Pro Glu Ser 

165 170 175 

Glu Thr Phe Asn Asn Thr Leu Asp His Gly Val Cys Thr Lys Phe Glu 

180 185 190 
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Ala Ser Gin Leu Gly Asp Glu Val Ala Ala Asn Phe Thr Ala Leu Phe 
195 200 205 

Ala Pro Asp He Arg Ala Arg Ala Glu Lys His Leu Pro Gly Val Thr 
210 215 220 

Leu Thr Asp Glu Asp Val Val Ser Leu Met Asp Met Cys Ser Phe Asp 
225 230 235 240 

Thr Val Ala Arg Thr Ser Asp Ala Ser Gin Leu Ser Pro Phe Cys Gin 

245 250 255 

Leu Phe Thr His Asn Glu Trp Lys Lys Tyr Asn Tyr Leu Gin Ser Leu 

260 265 270 

Gly Lys Tyr Tyr Gly Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin 
275 280 285 

Gly He Gly Phe Thr Asn Glu Leu He Ala Arg Leu Thr Arg Ser Pro 
290 295 300 

Val Gin Asp His Thr Ser Thr Asn Ser Thr Leu Val Ser Asn Pro Ala 
305 310 315 320 

Thr Phe Pro Leu Asn Ala Thr Met Tyr Val Asp Phe Ser His Asp Asn 

325 330 335 

Ser Met Val Ser He Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Glu 

340 345 350 

Pro Leu Ser Arg Thr Ser Val Glu Ser Ala Lys Glu Leu Asp Gly Tyr 
355 360 365 

Ser Ala Ser Trp Val Val Pro Phe Gly Ala Arg Ala Tyr Phe Glu Thr 
370 375 380 

Met Gin Cys Lys Ser Glu Lys Glu Ser Leu Val Arg Ala Leu He Asn 
385 390 395 400 

Asp Arg Val Val Pro Leu His Gly Cys Asp Val Asp Lys Leu Gly Arg 

405 410 415 

Cys Lys Leu Asn Asp Phe Val Lys Gly Leu Ser Trp Ala Arg Ser Gly 

420 425 430 

Gly Asn Trp Gly Glu Cys Phe Ser 
435 440 



<210> 9 
<211> 440 
<212> PRT 

<213> Aspergillus fumigatus 26906 
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<400> 9 

Gly Ser Lys Ser 
1 

Ala Thr Ser His 

20 

Asp Glu Leu Ser 
35 

Leu Val Gin Val 
50 

Lys Ser Lys Lys 
65 

Thr Asp Phe Lys 



Leu Gly Ala Asp 

100 

Ser Gly lie Lys 
115 

Val Pro Phe lie 
130 

Glu Lys Phe lie 
145 

Ala Thr Asn Arg 



Glu Thr Phe Asn 

180 

Ala Ser Gin Leu 
195 

Ala Pro Asp lie 
210 

Leu Thr Asp Glu 
225 

Thr Val Ala Arg 



Leu Phe Thr His 

260 



Cys Asp Thr Val 
5 

Leu Trp Gly Gin 



Val Ser Ser Lys 

40 

Leu Ser Arg His 
55 

Tyr Lys Lys Leu 
70 

Gly Lys Phe Ala 
85 

Asp Leu Thr Ala 



Phe Tyr Gin Arg 

120 

Arg Ala Ser Gly 
135 

Glu Gly Phe Gin 
150 

Ala Ala Pro Ala 
165 

Asn Thr Leu Asp 



Gly Asp Glu Val 

200 

Arg Ala Arg Ala 
215 

Asp Val Val Ser 
230 

Thr Ser Asp Ala 
245 

Asn Glu Trp Lys 



Asp Leu Gly Tyr 
10 

Tyr Ser Pro Phe 
25 

Leu Pro Lys Asp 



Gly Ala Arg Tyr 

60 

Val Thr Ala He 
75 

Phe Leu Lys Thr 
90 

Phe Gly Glu Gin 
105 

Tyr Lys Ala Leu 



Ser Asp Arg Val 

140 

Gin Ala Lys Leu 
155 

He Ser Val He 
170 

His Gly Val Cys 
185 

Ala Ala Asn Phe 



Lys Lys His Leu 

220 

Leu Met Asp Met 
235 

Ser Gin Leu Ser 
250 

Lys Tyr Asn Tyr 
265 



Gin Cys Ser Pro 
15 

Phe Ser Leu Glu 
30 

Cys Arg He Thr 
45 

Pro Thr Ser Ser 



Gin Ala Asn Ala 

80 

Tyr Asn Tyr Thr 
95 

Gin Leu Val Asn 
110 

Ala Arg Ser Val 
125 

He Ala Ser Gly 



Ala Asp Pro Gly 

160 

He Pro Glu Ser 
175 

Thr Lys Phe Glu 
190 

Thr Ala Leu Phe 
205 

Pro Gly Val Thr 



Cys Ser Phe Asp 

240 

Pro Phe Cys Gin 
255 

Leu Gin Ser Leu 
270 
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Gly Lys Tyr Tyr Gly Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin 
275 280 285 

Gly He Gly Phe Thr Asn Glu Leu He Ala Arg Leu Thr Arg Ser Pro 
290 295 ~ 300 

Val Gin Asp His Thr Ser Thr Asn Ser Thr Leu Val Ser Asn Pro Ala 
305 310 315 320 

Thr Phe Pro Leu Asn Ala Thr Met Tyr Val Asp Phe Ser His Asp Asn 

325 330 335 

Ser Met Val Ser He Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Glu 

340 345 350 

Pro Leu Ser Arg Thr Ser Val Glu Ser Ala Lys Glu Leu Asp Gly Tyr 
355 360 365 

Ser Ala Ser Trp Val Val Pro Phe Gly Ala Arg Ala Tyr Phe Glu Thr 
370 375 380 



Met Gin Cys Lys Ser Glu Lys Glu 
385 390 

Asp Arg Val Val Pro Leu His Gly 

405 

Cys Lys Leu Asn Asp Phe Val Lys 

420 

Gly Asn Trp Gly Glu Cys Phe Ser 
435 440 



Pro Leu Val Arg Ala Leu He Asn 
395 400 

Cys Asp Val Asp Lys Leu Gly Arg 
410 415 

Gly Leu Ser Trp Ala Arg Ser Gly 
425 430 



<210> 10 
<211> 440 
<212> PRT 

<213> Aspergillus fumigatus 32239 
<400> 10 

Gly Ser Lys Ala Cys Asp Thr Val Glu Leu Gly Tyr Gin Cys Ser Pro 
1 5 10 15 

Gly Thr Ser His Leu Trp Gly Gin Tyr Ser Pro Phe Phe Ser Leu Glu 

20 25 30 

Asp Glu Leu Ser Val Ser Ser Asp Leu Pro Lys Asp Cys Arg Val Thr 
35 40 "* J 45 

Phe Val Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ala Ser 
50 55 60 

Lys Ser Lys Lys Tyr Lys Lys Leu Val Thr Ala He Gin Lys Asn Ala 
65 70 75 80 
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Thr Glu Phe Lys Gly Lys Phe Ala Phe Leu Glu Thr Tyr Asn Tyr Thr 

85 90 95 

Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu Gin Gin Met Val Asn 

100 105 110 

Ser Gly He Lys Phe Tyr Gin Lys Tyr Lys Ala Leu Ala Gly Ser Val 
115 120 125 

Val Pro Phe He Arg Ser Ser Gly Ser Asp Arg Val He Ala Ser Gly 
130 135 140 

Glu Lys Phe He Glu Gly Phe Gin Gin Ala Asn Val Ala Asp Pro Gly 
145 150 155 160 

Ala Thr Asn Arg Ala Ala Pro Val He Ser Val He He Pro Glu Ser 

165 170 175 

Glu Thr Tyr Asn Asn Thr Leu Asp His Ser Val Cys Thr Asn Phe Glu 

180 185 190 

Ala Ser Glu Leu Gly Asp Glu Val Glu Ala Asn Phe Thr Ala Leu Phe 
195 200 205 

Ala Pro Ala He Arg Ala Arg He Glu Lys His Leu Pro Gly Val Gin 
210 215 220 

Leu Thr Asp Asp Asp Val Val Ser Leu Met Asp Met Cys Ser Phe Asp 
225 230 235 240 

Thr Val Ala Arg Thr Ala Asp Ala Ser Glu Leu Ser Pro Phe Cys Ala 

245 250 255 

He Phe Thr His Asn Glu Trp Lys Lys Tyr Asp Tyr Leu Gin Ser Leu 

260 265 270 

Gly Lys Tyr Tyr Gly Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin 
275 280 285 

Gly He Gly Phe Thr Asn Glu Leu He Ala Arg Leu Thr Asn Ser Pro 
290 295 " 300 

Val Gin Asp His Thr Ser Thr Asn Ser Thr Leu Asp Ser Asp Pro Ala 
305 310 315 320 

Thr Phe Pro Leu Asn Ala Thr lie Tyr Val Asp Phe Ser His Asp Asn 

325 330 335 

Gly Met He Pro He Phe Phe Ala Met Gly Leu Tyr Asn Gly Thr Glu 

340 345 350 

Pro Leu Ser Gin Thr Ser Glu Glu Ser Thr Lys Glu Ser Asn Gly Tyr 
355 360 * 365 
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Ser Ala Ser Trp Ala Val Pro Phe 
370 375 

Met Gin Cys Lys Ser Glu Lys Glu 
385 390 

Asp Arg Val Val Pro Leu His Gly 

405 

Cys Lys Leu Lys Asp Phe Val Lys 

420 

Gly Asn Ser Glu Gin Ser Phe Ser 
435 440 



Gly Ala Arg Ala Tyr Phe Glu Thr 

380 

Pro Leu Val Arg Ala Leu lie Asn 
395 400 

Cys Ala Val Asp Lys Leu Gly Arg 
410 415 

Gly Leu Ser Trp Ala Arg Ser Gly 
425 430 



<210> 11 

<211> 439 

<212> PRT 

<213> Emericella nidulans 



<400> 11 
Gin Asn His Ser 
1 

Asn Val Ser His 

20 

Gin Glu Ser Ala 
35 

Phe Val Gin Val 
50 

Lys Ser Lys Ala 
65 

Thr Ser Phe Trp 



Leu Gly Ala Asp 

100 

Ser Gly Ala Lys 
115 

Thr Pro Phe lie 
130 

Glu Lys Phe lie 
145 



Cys Asn Thr Ala 
5 

Val Trp Gly Gin 



lie Ser Glu Asp 

40 

Leu Ser Arg His 
55 

Tyr Ser Gly Leu 
70 

Gly Gin Tyr Ala 
85 

Asp Leu Thr lie 



Phe Tyr Arg Arg 

120 

Arg Ala Ser Gly 
135 

Asn Gly Phe Arg 
150 



Asp Gly Gly Tyr 
10 

Tyr Ser Pro Tyr 
25 

Val Pro His Gly 



Gly Ala Arg Tyr 

60 

lie Glu Ala lie 
75 

Phe Leu Glu Ser 
90 

Phe Gly Glu Asn 
105 

Tyr Lys Asn Leu 



Ser Asp Arg Val 

140 

Lys Ala Gin Leu 
155 



Gin Cys Phe Pro 
15 

Phe Ser lie Glu 
30 

Cys Glu Val Thr 
45 

Pro Thr Glu Ser 



Gin Lys Asn Ala 

80 

Tyr Asn Tyr Thr 
95 

Gin Met Val Asp 
110 

Ala Arg Lys Asn 
125 

Val Ala Ser Ala 



His Asp His Gly 

160 
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Ser Gly Gin Ala 



Gly Phe Asn Asn 

180 

Asp Glu Arg Ala 
195 

Pro Pro lie Arg 
210 

Thr Asn Glu Asn 
225 

Met Ala Arg Thr 



Phe Thr Glu Lys 

260 

Lys Tyr Tyr Gly 
275 

lie Gly Phe Thr 
290 

Gin Asp Asn Thr 
305 

Phe Pro Leu Asp 



Met He Ser He 

340 

Leu Ser Met Asp 
355 

Ala Ser Trp Thr 
370 

Gin Cys Glu Lys 
385 

Val Val Pro Leu 



Leu Asp Asp Trp 

420 

Trp Lys Thr Cys 
435 



Thr Pro Val Val 
165 

Thr Leu Asp His 



Asp Glu He Glu 

200 

Lys Arg Leu Glu 
215 

Val He Tyr Leu 
230 

Ala His Gly Thr 
245 

Glu Trp Leu Gin 



Tyr Gly Ala Gly 

280 

Asn Glu Leu He 
295 

Ser Thr Asn His 
310 

Arg Lys Leu Tyr 
325 

Phe Phe Ala Met 



Ser Val Glu Ser 

360 

Val Pro Phe Gly 
375 

Lys Glu Pro Leu 
390 

His Gly Cys Ala 
405 

Val Glu Gly Leu 



Phe Thr Leu 



19 

Asn Val He He 
170 

Ser Thr Cys Val 
185 

Ala Asn Phe Thr 



Asn Asp Leu Pro 

220 

Met Asp Met Cys 
235 

Glu Leu Ser Pro 
250 

Tyr Asp Tyr Leu 
265 

Ser Pro Leu Gly 

Ala Arg Leu Thr 

300 

Thr Leu Asp Ser 
315 

Ala Asp Phe Ser 
330 

Gly Leu Tyr Asn 
345 

He Gin Glu Met 



Ala Arg Ala Tyr 

380 

Val Arg Val Leu 
395 

Val Asp Lys Phe 
410 

Asn Phe Ala Arg 
425 



Pro Glu He Asp 
175 

Ser Phe Glu Asn 
190 

Ala He Met Gly 
205 

Gly He Lys Leu 



Ser Phe Asp Thr 

240 

Phe Cys Ala He 
255 

Gin Ser Leu Ser 
270 

Pro Ala Gin Gly 
285 

Gin Ser Pro Val 



Asn Pro Ala Thr 

320 

His Asp Asn Ser 
335 

Gly Thr Gin Pro 
350 

Asp Gly Tyr Ala 
365 

Phe Glu Leu Met 



Val Asn Asp Arg 

400 

Gly Arg Cys Thr 
415 

Ser Gly Gly Asn 
430 
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<210> 12 
<211> 443 
<212> PRT 

<213> Talaromyces thermophilus 
<400> 12 

Asp Ser His Ser Cys Asn Thr Val Glu Gly Gly Tyr Gin Cys Arg Pro 
15 10 15 

Glu He Ser His Ser Trp Gly Gin Tyr Ser Pro Phe Phe Ser Leu Ala 

20 25 30 

Asp Gin Ser Glu He Ser Pro Asp Val Pro Gin Asn Cys Lys He Thr 
35 40 45 

Phe Val Gin Leu Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser 
50 55 60 

Lys Thr Glu Leu Tyr Ser Gin Leu He Ser Arg He Gin Lys Thr Ala 
65 70 75 80 

Thr Ala Tyr Lys Gly Tyr Tyr Ala Phe Leu Lys Asp Tyr Arg Tyr Gin 

85 90 95 

Leu Gly Ala Asn Asp Leu Thr Pro Phe Gly Glu Asn Gin Met He Gin 

100 105 no 

Leu Gly He Lys Phe Tyr Asn His Tyr Lys Ser Leu Ala Arg Asn Ala 
115 120 125 

Val Pro Phe Val Arg Cys Ser Gly Ser Asp Arg Val He Ala Ser Gly 
130 135 140 

Arg Leu Phe He Glu Gly Phe Gin Ser Ala Lys Val Leu Asp Pro His 
i45 150 155 160 

Ser Asp Lys His Asp Ala Pro Pro Thr He Asn Val He He Glu Glu 

165 170 175 

Gly Pro Ser Tyr Asn Asn Thr Leu Asp Thr Gly Ser Cys Pro Val Phe 

180 185 190 

Glu Asp Ser Ser Gly Gly His Asp Ala Gin Glu Lys Phe Ala Lys Gin 
195 200 " 205 

Phe Ala Pro Ala He Leu Glu Lys He Lys Asp His Leu Pro Gly Val 
210 215 220 

Asp Leu Ala Val Ser Asp Val Pro Tyr Leu Met Asp Leu Cys Pro Phe 
225 230 235 240 

Glu Thr Leu Ala Arg Asn His Thr Asp Thr Leu Ser Pro Phe Cys Ala 

245 250 255 
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Leu Ser Thr Gin Glu Glu Trp Gin Ala Tyr Asp Tyr Tyr Gin Ser Leu 

260 265 270 

Gly Lys Tyr Tyr Gly Asn Gly Gly Gly Asn Pro Leu Gly Pro Ala Gin 
275 280 285 

Gly Val Gly Phe Val Asn Glu Leu lie Ala Arg Met Thr His Ser Pro 
290 295 300 

Val Gin Asp Tyr Thr Thr Val Asn His Thr Leu Asp Ser Asn Pro Ala 
305 310 315 320 

Thr Phe Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn 

325 330 335 

Thr Met Thr Ser lie Phe Ala Ala Leu Gly Leu Tyr Asn Gly Thr Ala 

340 345 350 

Lys Leu Ser Thr Thr Glu He Lys Ser He Glu Glu Thr Asp Gly Tyr 
355 360 365 

Ser Ala Ala Trp Thr Val Pro Phe Gly Gly Arg Ala Tyr He Glu Met 
370 375 380 

Met Gin Cys Asp Asp Ser Asp Glu Pro Val Val Arg Val Leu Val Asn 
385 390 395 400 

Asp Arg Val Val Pro Leu His Gly Cys Glu Val Asp Ser Leu Gly Arg 

405 410 415 

Cys Lys Arg Asp Asp Phe Val Arg Gly Leu Ser Phe Ala Arg Gin Gly 

420 425 430 

Gly Asn Trp Glu Gly Cys Tyr Ala Ala Ser Glu 
435 440 



<210> 13 
<211> 466 
<212> PRT 

<213> Myceliophthora thermophila 
<400> 13 

Glu Ser Arg Pro Cys Asp Thr Pro Asp Leu Gly Phe Gin Cys Gly Thr 
15 10 15 

Ala He Ser His Phe Trp Gly Gin Tyr Ser Pro Tyr Phe Ser Val Pro 

20 25 30 

Ser Glu Leu Asp Ala Ser He Pro Asp Asp Cys Glu Val Thr Phe Ala 
35 40 45 
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Gin Val Leu Ser Arg His Gly Ala Arg Ala Pro Thr Leu Lys Arg Ala 
50 55 60 

Ala Ser Tyr Val Asp Leu He Asp Arg He His His Gly Ala He Ser 
65 70 75 80 

Tyr Gly Pro Gly Tyr Glu Phe Leu Arg Thr Tyr Asp Tyr Thr Leu Gly 

85 90 95 

Ala Asp Glu Leu Thr Arg Thr Gly Gin Gin Gin Met Val Asn Ser Gly 

100 105 110 

He Lys Phe Tyr Arg Arg Tyr Arg Ala Leu Ala Arg Lys Ser He Pro 
115 120 125 

Phe Val Arg Thr Ala Gly Gin Asp Arg Val Val His Ser Ala Glu Asn 
130 135 140 

Phe Thr Gin Gly Phe His Ser Ala Leu Leu Ala Asp Arg Gly Ser Thr 
145 150 155 160 

Val Arg Pro Thr Leu Pro Tyr Asp Met Val Val He Pro Glu Thr Ala 

165 170 175 

Gly Ala Asn Asn Thr Leu His Asn Asp Leu Cys Thr Ala Phe Glu Glu 

180 185 190 

Gly Pro Tyr Ser Thr He Gly Asp Asp Ala Gin Asp Thr Tyr Leu Ser 
195 200 205 

Thr Phe Ala Gly Pro He Thr Ala Arg Val Asn Ala Asn Leu Pro Gly 
210 215 220 

Ala Asn Leu Thr Asp Ala Asp Thr Val Ala Leu Met Asp Leu Cys Pro 
225 230 235 240 

Phe Glu Thr Val Ala Ser Ser Ser Ser Asp Pro Ala Thr Ala Asp Ala 

245 250 255 

Gly Gly Gly Asn Gly Arg Pro Leu Ser Pro Phe Cys Arg Leu Phe Ser 

260 265 270 

Glu Ser Glu Trp Arg Ala Tyr Asp Tyr Leu Gin Ser Val Gly Lys Trp 
275 280 285 

Tyr Gly Tyr Gly Pro Gly Asn Pro Leu Gly Pro Thr Gin Gly Val Gly 
290 295 300 

Phe Val Asn Glu Leu Leu Ala Arg Leu Ala Gly Val Pro Val Arg Asp 
305 310 315 320 



Gly Thr Ser Thr Asn Arg Thr Leu Asp Gly Asp Pro Arg Thr Phe Pro 

325 330 335 
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Leu Gly Arg Pro Leu Tyr Ala Asp Phe Ser His Asp Asn Asp Met Met 

340 345 350 

Gly Val Leu Gly Ala Leu Gly Ala Tyr Asp Gly Val Pro Pro Leu Asp 
355 360 365 

Lys Thr Ala Arg Arg Asp Pro Glu Glu Leu Gly Gly Tyr Ala Ala Ser 
370 375 380 

Trp Ala Val Pro Phe Ala Ala Arg lie Tyr Val Glu Lys Met Arg Cys 
385 390 395 400 

Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Glu Gly Arg Gin Glu Lys 

405 410 415 

Asp Glu Glu Met Val Arg Val Leu Val Asn Asp Arg Val Met Thr Leu 

420 425 430 

Lys Gly Cys Gly Ala Asp Glu Arg Gly Met Cys Thr Leu Glu Arg Phe 
435 440 445 

lie Glu Ser Met Ala Phe Ala Arg Gly Asn Gly Lys Trp Asp Leu Cys 
450 455 460 

Phe Ala 
465 



<210> 14 
<211> 441 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 

<400> 14 

Asn Ser His Ser Cys Asp Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro 
15 10 15 

Glu lie Ser His Leu Trp Gly Gin Tyr Ser Pro Tyr Phe Ser Leu Glu 

20 25 30 

Asp Glu Ser Ala lie Ser Pro Asp Val Pro Asp Asp Cys Arg Val Thr 
35 40 " 45 

Phe Val Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser 
50 55 60 

Lys Ser Lys Ala Tyr Ser Ala Leu lie Glu Ala He Gin Lys Asn Ala 
65 70 75 80 
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Thr Ala Phe Lys Gly 

85 

Leu Gly Ala Asp Asp 

100 

Ser Gly lie Lys Phe 
115 

Val Pro Phe lie Arg 
130 

Glu Lys Phe lie Glu 
145 

Ser Gin Pro His Gin 

165 

Gly Ser Gly Tyr Asn 

180 

Glu Asp Ser Glu Leu 
195 

Phe Ala Pro Ala lie 
210 

Thr Leu Thr Asp Glu 
225 

Glu Thr Val Ala Arg 

245 

Ala Leu Phe Thr His 

260 

Leu Gly Lys Tyr Tyr 
275 

Gin Gly Val Gly Phe 
290 

Pro Val Gin Asp His 
305 

Ala Thr Phe Pro Leu 

325 

Asn Ser Met lie Ser 

340 



24 

Lys Tyr Ala Phe Leu Lys 

90 

Leu Thr Pro Phe Gly Glu 

105 

Tyr Arg Arg Tyr Lys Ala 
120 

Ala Ser Gly Ser Asp Arg 
135 

Gly Phe Gin Ser Ala Lys 
150 155 

Ala Ser Pro Val lie Asp 

170 

Asn Thr Leu Asp His Gly 

185 

Gly Asp Asp Val Glu Ala 
200 

Arg Ala Arg Leu Glu Ala 
215 

Asp Val Val Tyr Leu Met 
230 235 

Thr Ser Asp Ala Thr Glu 

250 

Asp Glu Trp Arg Gin Tyr 

265 

Gly Tyr Gly Ala Gly Asn 
280 

Ala Asn Glu Leu lie Ala 
295 

Thr Ser Thr Asn His Thr 
310 315 

Asn Ala Thr Leu Tyr Ala 

330 

lie Phe Phe Ala Leu Gly 

34 5 



Thr Tyr Asn Tyr Thr 

95 

Asn Gin Met Val Asn 
110 

Leu Ala Arg Lys lie 
125 

Val He Ala Ser Ala 
140 

Leu Ala Asp Pro Gly 

160 

Val He He Pro Glu 

175 

Thr Cys Thr Ala Phe 
190 

Asn Phe Thr Ala Leu 
205 

Asp Leu Pro Gly Val 
220 

Asp Met Cys Pro Phe 

240 

Leu Ser Pro Phe Cys 

255 

Asp Tyr Leu Gin Ser 
270 

Pro Leu Gly Pro Ala 
285 

Arg Leu Thr Arg Ser 
300 

Leu Asp Ser Asn Pro 

320 

Asp Phe Ser His Asp 

335 

Leu Tyr Asn Gly Thr 
350 



Ala Pro Leu Ser Thr Thr Ser Val Glu Ser He Glu Glu Thr Asp Gly 
355 360 365 
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Tyr Ser Ala Ser Trp Thr Val Pro Phe Gly Ala Arg Ala Tyr Val Glu 
370 375 380 

Met Met Gin Cys Gin Ala Glu Lys Glu Pro Leu Val Arg Val Leu Val 
385 390 395 400 

Asn Asp Arg Val Val Pro Leu His Gly Cys Ala Val Asp Lys Leu Gly 

405 410 415 

Arg Cys Lys Arg Asp Asp Phe Val Glu Gly Leu Ser Phe Ala Arg Ser 

420 425 430 

Gly Gly Asn Trp Ala Glu Cys Phe Ala 
435 440 



<210> 15 
<211> 1426 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 

<220> 

<221> CDS 

<222> (12) . . (1412) 

<220> 

<22l> sig_peptide 
<222> (12) . . (89) 

<220> 

<221> mat_peptide 
<222> (90) . . (1412) 

<400> 15 

tatatgaatt c atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg 50 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu 
-25 -20 -is 

ttc ggt tec aca tec ggt acc gec ttg ggt cct cgt ggt aat tct cac 98 
Phe Gly Ser Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His 

-10 -5 -i i 

tct tgt gac act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct 146 
Ser Cys Asp Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser 
5 10 15 

cac ttg tgg ggt caa tac tct cca tac ttc tct ttg gaa gac gaa tct 194 
His Leu Trp Gly Gin Tyr Ser Pro Tyr Phe Ser Leu Glu Asp Glu Ser 
20 25 30 35 
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get att tct cca gac gtt cca gac gac tgt aga gtt act ttc gtt caa 242 
Ala He Ser Pro Asp Val Pro Asp Asp Cys Arg Val Thr Phe Val Gin 

40 45 50 

gtt ttg tct aga cac ggt get aga tac cca act tct tct aag tct aag 290 
Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys 

55 60 65 

get tac tct get ttg att gaa get att caa aag aac get act get ttc 338 
Ala Tyr Ser Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe 
70 75 80 

a ^g ggt aag tac get ttc ttg aag act tac aac tac act ttg ggt get 386 
Lys Gly Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala 
85 90 95 

gac gac ttg act cca ttc ggt gaa aac caa atg gtt aac tct ggt att 434 
Asp Asp Leu Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly He 
100 105 no us 

aag ttc tac aga aga tac aag get ttg get aga aag att gtt cca ttc 482 
Lys Phe Tyr Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe 

120 125 130 

att aga get tct ggt tct gac aga gtt att get tct get gaa aag ttc 530 
He Arg Ala Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe 

135 140 145 

att gaa ggt ttc caa tct get aag ttg get gac cca ggt tct caa cca 578 
He Glu Gly Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ser Gin Pro 
150 155 160 

cac caa get tct cca gtt att gac gtt att att cca gaa gga tec ggt 626 
His Gin Ala Ser Pro Val He Asp Val He He Pro Glu Gly Ser Gly 
165 170 175 

tac aac aac act ttg gac cac ggt act tgt act get ttc gaa gac tct 674 
Tyr Asn Asn Thr Leu Asp His Gly Thr Cys Thr Ala Phe Glu Asp Ser 
180 185 190 195 

gaa ttg ggt gac gac gtt gaa get aac ttc act get ttg ttc get cca 722 
Glu Leu Gly Asp Asp Val Glu Ala Asn Phe Thr Ala Leu Phe Ala Pro 

200 205 210 

get att aga get aga ttg gaa get gac ttg cca ggt gtt act ttg act 770 
Ala He Arg Ala Arg Leu Glu Ala Asp Leu Pro Gly Val Thr Leu Thr 

215 220 225 

gac gaa gac gtt gtt tac ttg atg gac atg tgt cca ttc gaa act gtt 818 
Asp Glu Asp Val Val Tyr Leu Met Asp Met Cys Pro Phe Glu Thr Val 
230 235 * 240 



get aga act tct gac get act gaa ttg tct cca ttc tgt get ttg ttc 
Ala Arg Thr Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Ala Leu Phe 



866 
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245 250 255 

act cac gac gaa tgg aga caa tac gac tac ttg caa tct ttg ggt aag 914 
Thr His Asp Glu Trp Arg Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys 
260 265 270 275 

tac tac ggt tac ggt get ggt aac cca ttg ggt cca get caa ggt gtt 962 
Tyr Tyr Gly Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val 

280 285 290 



ggt ttc get aac gaa ttg att get aga ttg act aga tct cca gtt caa 
Gly Phe Ala Asn Glu Leu lie Ala Arg Leu Thr Arg Ser Pro Val Gin 

295 300 305 



1010 



gac cac act tct act aac cac act ttg gac tct aac cca get act ttc 1058 
Asp His Thr Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe 
310 315 320 

cca ttg aac get act ttg tac get gac ttc tct cac gac aac tct atg 1106 
Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Ser Met 
325 330 335 

att tct att ttc ttc get ttg ggt ttg tac aac ggt act get cca ttg 1154 
lie Ser lie Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Ala Pro Leu 
340 345 350 355 

tct act act tct gtt gaa tct att gaa gaa act gac ggt tac tct get 1202 
Ser Thr Thr Ser Val Glu Ser He Glu Glu Thr Asp Gly Tyr Ser Ala 

360 365 370 

tct tgg act gtt cca ttc ggt get aga get tac gtt gaa atg atg caa 1250 
Ser Trp Thr Val Pro Phe Gly Ala Arg Ala Tyr Val Glu Met Met Gin 

375 380 385 

tgt caa get gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga 12 98 
Cys Gin Ala Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg 
390 395 400 

gtt gtt cca ttg cac ggt tgt get gtt gac aag ttg ggt aga tgt aag 1346 
Val Val Pro Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys 
405 410 415 

aga gac gac ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac 1394 
Arg Asp Asp Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn 
420 425 430 435 

tgg get gaa tgt ttc get taagaattca tata 1426 
Trp Ala Glu Cys Phe Ala 

440 



<210> 16 
<211> 467 
<212> PRT 
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<213> Artificial Sequence 

<223> Description of Artificial Sequence: consensus 
phytase 



<400> 16 
Met Gly Val Phe 
-25 

Thr Ser Gly Thr 
-10 

Thr Val Asp Gly 

10 

Gly Gin Tyr Ser 
25 

Pro Asp Val Pro 
40 

Arg His Gly Ala 
55 

Ala Leu lie Glu 



Tyr Ala Phe Leu 

90 

Thr Pro Phe Gly 
105 

Arg Arg Tyr Lys 
120 

Ser Gly Ser Asp 
135 

Phe Gin Ser Ala 



Ser Pro Val lie 

170 

Thr Leu Asp His 
185 

Asp Asp Val Glu 
200 

Ala Arg Leu Glu 
215 



Val Val Leu Leu 
-20 

Ala Leu Gly Pro 
-5 

Gly Tyr Gin Cys 



Pro Tyr Phe Ser 

30 

Asp Asp Cys Arg 
45 

Arg Tyr Pro Thr 
60 

Ala lie Gin Lys 
75 

Lys Thr Tyr Asn 



Glu Asn Gin Met 

110 

Ala Leu Ala Arg 
125 

Arg Val lie Ala 
140 

Lys Leu Ala Asp 
155 

Asp Val lie lie 



Gly Thr Cys Thr 

190 

Ala Asn Phe Thr 
205 

Ala Asp Leu Pro 
220 



Ser lie Ala Thr 

-15 

Arg Gly Asn Ser 
-1 1 

Phe Pro Glu lie 
15 

Leu Glu Asp Glu 



Val Thr Phe Val 

50 

Ser Ser Lys Ser 
65 

Asn Ala Thr Ala 
80 

Tyr Thr Leu Gly 
95 

Val Asn Ser Gly 



Lys lie Val Pro 

130 

Ser Ala Glu Lys 
145 

Pro Gly Ser Gin 
160 

Pro Glu Gly Ser 
175 

Ala Phe Glu Asp 



Ala Leu Phe Ala 

210 

Gly Val Thr Leu 
225 



Leu Phe Gly Ser 



His Ser Cys Asp 
5 

Ser His Leu Trp 
20 

Ser Ala lie Ser 
35 

Gin Val Leu Ser 



Lys Ala Tyr Ser 

70 

Phe Lys Gly Lys 
85 

Ala Asp Asp Leu 
100 

lie Lys Phe Tyr 
115 

Phe lie Arg Ala 



Phe lie Glu Gly 

150 

Pro His Gin Ala 
165 

Gly Tyr Asn Asn 
180 

Ser Glu Leu Gly 
195 

Pro Ala lie Arg 



Thr Asp Glu Asp 

230 
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Val Val Tyr Leu Met Asp Met Cys Pro Phe Glu Thr Val Ala Arg Thr 

235 240 245 

Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Ala Leu Phe Thr His Asp 

250 255 260 

Glu Trp Arg Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 
265 270 275 

Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Ala 
280 285 290 

Asn Glu Leu lie Ala Arg Leu Thr Arg Ser Pro Val Gin Asp His Thr 
295 300 305 310 

Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 

315 320 325 

Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Ser Met lie Ser lie 

330 335 340 

Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Ala Pro Leu Ser Thr Thr 
345 350 355 

Ser Val Glu Ser lie Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 
360 365 370 

Val Pro Phe Gly Ala Arg Ala Tyr Val Glu Met Met Gin Cys Gin Ala 
375 380 385 390 

Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 

395 400 405 

Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys Arg Asp Asp 

410 415 420 

Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Ala Glu 
425 430 435 

Cys Phe Ala 
440 



<210> 17 

<211> 422 

<212> PRT 

<213> Paxillus involutus phyAl 



<400> 17 

Ser Val Pro Lys Asn Thr Ala Pro Thr Phe Pro lie Pro Glu Ser Glu 
15 10 15 



WO 00/43503 



PCT/DK00/00025 



30 

Gin Arg Asn Trp Ser Pro Tyr Ser Pro Tyr Phe Pro Leu Ala Glu Tyr 

20 25 30 

Lys Ala Pro Pro Ala Gly Cys Gin He Asn Gin Val Asn He He Gin 
35 40 45 

Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Thr Thr Arg He Lys 
50 55 60 

Ala Gly Leu Thr Lys Leu Gin Gly Val Gin Asn Phe Thr Asp Ala Lys 
65 70 75 80 

Phe Asn Phe He Lys Ser Phe Lys Tyr Asp Leu Gly Asn Ser Asp Leu 

85 90 95 

Val Pro Phe Gly Ala Ala Gin Ser Phe Asp Ala Gly Gin Glu Ala Phe 

100 105 no 

Ala Arg Tyr Ser Lys Leu Val Ser Lys Asn Asn Leu Pro Phe lie Arg 
115 120 125 

Ala Asp Gly Ser Asp Arg Val Val Asp Ser Ala Thr Asn Trp Thr Ala 
130 135 140 

Gly Phe Ala Ser Ala Ser His Asn Thr Val Gin Pro Lys Leu Asn Leu 
145 150 155 160 

lie Leu Pro Gin Thr Gly Asn Asp Thr Leu Glu Asp Asn Met Cys Pro 

165 170 175 

Ala Ala Gly Asp Ser Asp Pro Gin Val Asn Ala Trp Leu Ala Val Ala 

180 185 190 

Phe Pro Ser lie Thr Ala Arg Leu Asn Ala Ala Ala Pro Ser Val Asn 
195 200 205 

Leu Thr Asp Thr Asp Ala Phe Asn Leu Val Ser Leu Cys Ala Phe Leu 
210 215 22 0 

Thr Val Ser Lys Glu Lys Lys Ser Asp Phe Cys Thr Leu Phe Glu Gly 
225 230 235 240 

lie Pro Gly Ser Phe Glu Ala Phe Ala Tyr Gly Gly Asp Leu Asp Lys 

245 250 255 

Phe Tyr Gly Thr Gly Tyr Gly Gin Glu Leu Gly Pro Val Gin Gly Val 

260 265 270 

Gly Tyr Val Asn Glu Leu He Ala Arg Leu Thr Asn Ser Ala Val Arg 
275 280 285 

Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp Ala Ser Pro Val Thr Phe 
290 295 300 
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Pro Leu Asn Lys 
305 

Val Ala Val Phe 



Ser Thr Ser Val 

340 

Val Pro Phe Ser 
355 

Thr Thr Lys Val 
370 

Phe Cys Gly Gly 
385 

Glu Ser Gin Thr 



Cys Phe Ala Thr 

420 



Thr Phe Tyr Ala 
310 

Ser Ala Met Gly 
325 

Pro Asn Pro Trp 



Gly Arg Met Val 

360 

Arg Val Leu Val 
375 

Asp Arg Asn Gly 
390 

Phe Ala Arg Ser 
405 

Ser Ala 
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Asp Phe Ser His 
315 

Leu Phe Arg Gin 
330 

Arg Thr Trp Arg 
345 

Val Glu Arg Leu 



Gin Asp Gin Val 

380 

Leu Cys Thr Leu 
395 

Asp Gly Ala Gly 
410 



Asp Asn Leu Met 

320 

Pro Ala Pro Leu 
335 

Thr Ser Ser Leu 
350 

Ser Cys Phe Gly 
365 

Gin Pro Leu Glu 



Ala Lys Phe Val 

400 

Asp Phe Glu Lys 
415 



<210> 18 
<211> 422 
<212> PRT 

<213> Paxil lus involutus phyA2 
<400> 18 

Ser Val Pro Arg Asn He Ala Pro Lys Phe Ser He Pro Glu Ser Glu 
15 10 15 

Gin Arg Asn Trp Ser Pro Tyr Ser Pro Tyr Phe Pro Leu Ala Glu Tyr 

20 25 30 

Lys Ala Pro Pro Ala Gly Cys Glu He Asn Gin Val Asn He He Gin 
35 40 45 

Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Ala Thr Arg He Lys 
50 55 60 

Ala Gly Leu Ser Lys Leu Gin Ser Val Gin Asn Phe Thr Asp Pro Lys 
65 70 75 80 

Phe Asp Phe He Lys Ser Phe Thr Tyr Asp Leu Gly Thr Ser Asp Leu 

85 90 95 

Val Pro Phe Gly Ala Ala Gin Ser Phe Asp Ala Gly Leu Glu Val Phe 

100 105 no 

Ala Arg Tyr Ser Lys Leu Val Ser Ser Asp Asn Leu Pro Phe He Arg 
115 120 125 
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Ser Asp Gly Ser Asp Arg Val Val Asp Thr Ala Thr Asn Trp Thr Ala 
130 135 140 

Gly Phe Ala Ser Ala Ser Arg Asn Ala lie Gin Pro Lys Leu Asp Leu 
145 150 155 160 

He Leu Pro Gin Thr Gly Asn Asp Thr Leu Glu Asp Asn Met Cys Pro 

165 170 175 

Ala Ala Gly Glu Ser Asp Pro Gin Val Asp Ala Trp Leu Ala Ser Ala 

180 185 190 

Phe Pro Ser Val Thr Ala Gin Leu Asn Ala Ala Ala Pro Gly Ala Asn 
195 200 205 

Leu Thr Asp Ala Asp Ala Phe Asn Leu Val Ser Leu Cys Pro Phe Met 
210 215 220 

Thr Val Ser Lys Glu Gin Lys Ser Asp Phe Cys Thr Leu Phe Glu Gly 
225 230 235 240 

He Pro Gly Ser Phe Glu Ala Phe Ala Tyr Ala Gly Asp Leu Asp Lys 

245 250 255 

Phe Tyr Gly Thr Gly Tyr Gly Gin Ala Leu Gly Pro Val Gin Gly Val 

260 265 270 

Gly Tyr He Asn Glu Leu Leu Ala Arg Leu Thr Asn Ser Ala Val Asn 
275 280 285 

Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp Ala Ala Pro Asp Thr Phe 
290 295 300 

Pro Leu Asn Lys Thr Met Tyr Ala Asp Phe Ser His Asp Asn Leu Met 
305 310 315 320 

Val Ala Val Phe Ser Ala Met Gly Leu Phe Arg Gin Ser Ala Pro Leu 

325 330 335 

Ser Thr Ser Thr Pro Asp Pro Asn Arg Thr Trp Leu Thr Ser Ser Val 

340 345 350 

Val Pro Phe Ser Ala Arg Met Ala Val Glu Arg Leu Ser Cys Ala Gly 
355 360 365 

Thr Thr Lys Val Arg Val Leu Val Gin Asp Gin Val Gin Pro Leu Glu 
370 375 380 

Phe Cys Gly Gly Asp Gin Asp Gly Leu Cys Ala Leu Asp Lys Phe Val 
385 390 395 400 

Glu Ser Gin Ala Tyr Ala Arg Ser Gly Gly Ala Gly Asp Phe Glu Lys 

405 410 415 
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Cys Leu Ala Thr Thr Val 

420 



<210> 19 
<211> 420 
<212> PRT 

<213> Trametes pubescens 
<400> 19 

His He Pro Leu Arg Asp Thr Ser Ala Cys Leu Asp Val Thr Arg Asp 
15 10 15 

Val Gin Gin Ser Trp Ser Met Tyr Ser Pro Tyr Phe Pro Ala Ala Thr 

20 25 30 

Tyr Val Ala Pro Pro Ala Ser Cys Gin He Asn Gin Val His He He 
35 40 45 

Gin Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Ala Lys Arg He 
50 55 60 

Gin Thr Ala Val Ala Lys Leu Lys Ala Ala Ser Asn Tyr Thr Asp Pro 
65 70 75 80 

Leu Leu Ala Phe Val Thr Asn Tyr Thr Tyr Ser Leu Gly Gin Asp Ser 

85 90 95 

Leu Val Glu Leu Gly Ala Thr Gin Ser Ser Glu Ala Gly Gin Glu Ala 

100 105 no 

Phe Thr Arg Tyr Ser Ser Leu Val Ser Ala Asp Glu Leu Pro Phe Val 
115 120 ~ 125 

Arg Ala Ser Gly Ser Asp Arg Val Val Ala Thr Ala Asn Asn Trp Thr 
130 135 140 

Ala Gly Phe Ala Leu Ala Ser Ser Asn Ser He Thr Pro Val Leu Ser 
145 150 155 160 

Val He He Ser Glu Ala Gly Asn Asp Thr Leu Asp Asp Asn Met Cys 

165 170 175 

Pro Ala Ala Gly Asp Ser Asp Pro Gin Val Asn Gin Trp Leu Ala Gin 

160 185 190 

Phe Ala Pro Pro Met Thr Ala Arg Leu Asn Ala Gly Ala Pro Gly Ala 
195 200 205 

Asn Leu Thr Asp Thr Asp Thr Tyr Asn Leu Leu Thr Leu Cys Pro Phe 
210 215 220 
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Glu Thr Val Ala Thr 
225 

Glu Leu Gin Ala Glu 

245 

Phe Tyr Gly Thr Gly 

260 

Gly Tyr lie Asn Glu 
275 

Asp His Thr Gin Thr 
290 

Pro Leu Asn Arg Thr 
305 

Val Ala lie Phe Ser 

325 

Asp Pro Thr Thr Pro 

340 

Val Pro Phe Ser Ala 
355 

Ala Gin Ser Val Arg 
370 

Phe Cys Gly Ala Asp 
385 

Glu Ser Gin Ala Tyr 

405 

Cys Phe Ala Thr 

420 
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Glu Arg Arg Ser Glu Phe 
230 235 

Asp Ala Phe Ala Tyr Asn 

250 

Tyr Gly Gin Pro Leu Gly 

265 

Leu lie Ala Arg Leu Thr 
280 

Asn Ser Thr Leu Asp Ser 
295 

Leu Tyr Ala Asp Phe Ser 
310 315 

Ala Met Gly Leu Phe Asn 

330 

Asp Pro Ala Arg Thr Phe 

345 

Arg Met Val Val Glu Arg 
360 

Leu Leu Val Asn Asp Ala 
375 

Thr Ser Gly Val Cys Thr 
390 395 

Ala Arg Asn Asp Gly Glu 

410 



Cys Asp lie Tyr Glu 

240 

Ala Asp Leu Asp Lys 

255 

Pro Val Gin Gly Val 
270 

Ala Gin Asn Val Ser 
285 

Ser Pro Glu Thr Phe 
300 

His Asp Asn Gin Met 

320 

Gin Ser Ala Pro Leu 

335 

Leu Val Lys Lys lie 
350 

Leu Asp Cys Gly Gly 
365 

Val Gin Pro Leu Ala 
380 

Leu Asp Ala Phe Val 

400 

Gly Asp Phe Glu Lys 

415 



<210> 20 
<211> 435 
<212> PRT 

<213> Agrocybe pediades 
<400> 20 

Gly Gly Val Val Gin Ala Thr Phe 
1 5 

lie Gin Asp Ser Trp Ala Ala Tyr 

20 



Val Gin Pro Phe Phe Pro Pro Gin 
10 15 

Thr Pro Tyr Tyr Pro Val Gin Ala 
25 30 



Tyr Thr Pro Pro Pro Lys Asp Cys Lys He Thr Gin Val Asn He He 
35 40 45 
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Gin Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Gly Thr Arg lie 
50 55 60 

Gin Ala Ala Val Lys Lys Leu Gin Ser Ala Lys Thr Tyr Thr Asp Pro 
65 70 75 80 

Arg Leu Asp Phe Leu Thr Asn Tyr Thr Tyr Thr Leu Gly His Asp Asp 

85 90 95 

Leu Val Pro Phe Gly Ala Leu Gin Ser Ser Gin Ala Gly Glu Glu Thr 

100 105 no 

Phe Gin Arg Tyr Ser Phe Leu Val Ser Lys Glu Asn Leu Pro Phe Val 
115 120 125 

Arg Ala Ser Ser Ser Asn Arg Val Val Asp Ser Ala Thr Asn Trp Thr 
130 135 140 

Glu Gly Phe Ser Ala Ala Ser His His Val Leu Asn Pro He Leu Phe 
145 150 155 160 

Val He Leu Ser Glu Ser Leu Asn Asp Thr Leu Asp Asp Ala Met Cys 

165 170 175 

Pro Asn Ala Gly Ser Ser Asp Pro Gin Thr Gly He Trp Thr Ser He 

180 185 190 

Tyr Gly Thr Pro He Ala Asn Arg Leu Asn Gin Gin Ala Pro Gly Ala 
195 200 205 

Asn He Thr Ala Ala Asp Val Ser Asn Leu He Pro Leu Cys Ala Phe 
210 215 220 

Glu Thr He Val Lys Glu Thr Pro Ser Pro Phe Cys Asn Leu Phe Thr 
225 230 235 240 

Pro Glu Glu Phe Ala Gin Phe Glu Tyr Phe Gly Asp Leu Asp Lys Phe 

245 250 255 

Tyr Gly Thr Gly Tyr Gly Gin Pro Leu Gly Pro Val Gin Gly Val Gly 

260 265 270 

Tyr He Asn Glu Leu Leu Ala Arg Leu Thr Glu Met Pro Val Arg Asp 
275 280 285 

Asn Thr Gin Thr Asn Arg Thr Leu Asp Ser Ser Pro Leu Thr Phe Pro 
290 295 300 

Leu Asp Arg Ser He Tyr Ala Asp Leu Ser His Asp Asn Gin Met He 
305 310 315 320 

Ala He Phe Ser Ala Met Gly Leu Phe Asn Gin Ser Ser Pro Leu Asp 

325 330 335 
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Pro Ser Phe Pro Asn Pro Lys Arg 

340 

Pro Phe Ser Ala Arg Met Val Thr 
355 360 

Gly Thr Gly Ser Gly Gly Pro Ser 
370 375 

Gin Thr Phe Val Arg lie Leu Val 
385 390 

Phe Cys Gly Gly Asp Met Asp Ser 

405 

Glu Ser Gin Lys Tyr Ala Arg Glu 

420 

Cys Phe Asp 
435 



Thr Trp Val Thr Ser Arg Leu Thr 
345 350 

Glu Arg Leu Leu Cys Gin Arg Asp 

365 

Arg lie Met Arg Asn Gly Asn Val 

380 

Asn Asp Ala Leu Gin Pro Leu Lys 
395 400 

Leu Cys Thr Leu Glu Ala Phe Val 
410 415 

Asp Gly Gin Gly Asp Phe Glu Lys 
425 430 



<210> 21 
<211> 419 
<212> PRT 

<213> Peniophora lycii 



<400> 21 
Ser Thr Gin Phe 
1 

Asn Thr Ser Asn 

20 

Tyr Ala Ala Pro 
35 

Gin Arg His Gly 
50 

Val Ala Ala Val 

65 

Lys Tyr Glu Phe 



Leu Leu Pro Phe 

100 

Tyr Thr Arg Tyr 
115 



Ser Phe Val Ala 
5 

Trp Gly Pro Tyr 



Pro Glu Gly Cys 

40 

Ala Arg Trp Pro 
55 

Ala Lys lie Gin 
70 

Leu Asn Asp Phe 
85 

Gly Ala Asn Gin 



Ser Thr Leu Phe 

120 



Ala Gin Leu Pro 
10 

Asp Pro Phe Phe 
25 

Thr Val Thr Gin 



Thr Ser Gly Ala 

60 

Met Ala Arg Pro 
75 

Val Tyr Lys Phe 
90 

Ser His Gin Thr 
105 

Glu Gly Gly Asp 



lie Pro Ala Gin 
15 

Pro Val Glu Pro 
30 

Val Asn Leu lie 
45 

Arg Ser Arg Gin 



Phe Thr Asp Pro 

80 

Gly Val Ala Asp 
95 

Gly Thr Asp Met 
110 

Val Pro Phe Val 
125 
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Arg Ala Ala Gly Asp Gin Arg Val Val Asp Ser Ser Thr Asn Trp Thr 
130 135 140 

Ala Gly Phe Gly Asp Ala Ser Gly Glu Thr Val Leu Pro Thr Leu Gin 
145 150 155 160 

Val Val Leu Gin Glu Glu Gly Asn Cys Thr Leu Cys Asn Asn Met Cys 

165 170 175 

Pro Asn Glu Val Asp Gly Asp Glu Ser Thr Thr Trp Leu Gly Val Phe 

180 185 190 

Ala Pro Asn He Thr Ala Arg Leu Asn Ala Ala Ala Pro Ser Ala Asn 
195 200 205 

Leu Ser Asp Ser Asp Ala Leu Thr Leu Met Asp Met Cys Pro Phe Asp 
210 215 220 

Thr Leu Ser Ser Gly Asn Ala Ser Pro Phe Cys Asp Leu Phe Thr Ala 
225 230 235 240 

Glu Glu Tyr Val Ser Tyr Glu Tyr Tyr Tyr Asp Leu Asp Lys Tyr Tyr 

245 250 255 

Gly Thr Gly Pro Gly Asn Ala Leu Gly Pro Val Gin Gly Val Gly Tyr 

260 265 270 

Val Asn Glu Leu Leu Ala Arg Leu Thr Gly Gin Ala Val Arg Asp Glu 
275 280 285 

Thr Gin Thr Asn Arg Thr Leu Asp Ser Asp Pro Ala Thr Phe Pro Leu 
290 295 300 

Asn Arg Thr Phe Tyr Ala Asp Phe Ser His Asp Asn Thr Met Val Pro 
305 310 315 320 

He Phe Ala Ala Leu Gly Leu Phe Asn Ala Thr Ala Leu Asp Pro Leu 

325 330 335 

Lys Pro Asp Glu Asn Arg Leu Trp Val Asp Ser Lys Leu Val Pro Phe 

340 345 350 

Ser Gly His Met Thr Val Glu Lys Leu Ala Cys Ser Gly Lys Glu Ala 
355 360 365 

Val Arg Val Leu Val Asn Asp Ala Val Gin Pro Leu Glu Phe Cys Gly 
370 375 380 

Gly Val Asp Gly Val Cys Glu Leu Ser Ala Phe Val Glu Ser Gin Thr 
385 390 395 400 

Tyr Ala Arg Glu Asn Gly Gin Gly Asp Phe Ala Lys Cys Gly Phe Val 

405 410 415 
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Pro Ser Glu 



<210> 22 
<211> 369 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :basidio 
consensus 

<400> 22 

Ser Pro Arg Thr Ala Ala Gin Leu Pro He Pro Gin Gin Trp Ser Pro 
1 5 10 15 

Tyr Ser Pro Tyr Phe Pro Val Ala Tyr Ala Pro Pro Ala Gly Cys Gin 

20 25 30 

He Gin Val Asn He He Gin Arg His Gly Ala Arg Phe Pro Thr Ser 
35 40 45 

Gly Ala Ala Thr Arg He Gin Ala Ala Val Ala Lys Leu Gin Ser Ala 
50 55 60 

Thr Asp Pro Lys Leu Asp Phe Leu Asn Thr Tyr Leu Gly Asp Asp Leu 
65 70 75 80 

Val Pro Phe Gly Ala Gin Ser Ser Gin Ala Gly Gin Glu Ala Phe Thr 

85 90 95 

Arg Tyr Ser Leu Val Ser Asp Asn Leu Pro Phe Val Arg Ala Ser Gly 

100 105 no 

Ser Asp Arg Val Val Asp Ser Ala Thr Asn Trp Thr Ala Gly Phe Ala 
115 120 125 

Ala Ser Asn Thr Pro Leu Val He Leu Ser Glu Gly Asn Asp Thr Leu 
130 135 140 

Asp Asp Asn Met Cys Pro Ala Gly Asp Ser Asp Pro Gin Asn Trp Leu 
145 150 155 160 

Ala Val Phe Ala Pro Pro He Thr Ala Arg Leu Asn Ala Ala Ala Pro 

165 170 175 

Gly Ala Asn Leu Thr Asp Asp Ala Asn Leu Leu Cys Pro Phe Glu Thr 

180 185 190 



Val Ser Glu Ser Phe Cys Asp Leu Phe Glu Pro Glu Glu Phe Ala Phe 
195 200 205 
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Tyr Gly Asp Leu Asp Lys Phe Tyr Gly Thr Gly Tyr Gly Gin Pro Leu 
210 215 220 

Gly Pro Val Gin Gly Val Gly Tyr He Asn Glu Leu Leu Ala Arg Leu 
225 230 235 240 

Thr Gin Ala Val Arg Asp Asn Thr Gin Thr Asn Arg Thr Leu Asp Ser 

245 250 255 

Ser Pro Thr Phe Pro Leu Asn Arg Thr Phe Tyr Ala Asp Phe Ser His 

260 265 270 

Asp Asn Gin Met Val Ala He Phe Ser Ala Met Gly Leu Phe Asn Gin 
275 280 285 

Ser Ala Pro Leu Asp Pro Ser Pro Asp Pro Asn Arg Thr Trp Val Thr 
290 295 300 

Ser Lys Leu Val Pro Phe Ser Ala Arg Met Val Val Glu Arg Leu Cys 
305 310 315 320 

Gly Thr Val Arg Val Leu Val Asn Asp Ala Val Gin Pro Leu Glu Phe 

325 330 335 

Cys Gly Gly Asp Asp Gly Cys Thr Leu Asp Ala Phe Val Glu Ser Gin 

340 345 350 

Tyr Ala Arg Glu Asp Gly Gin Gly Asp Phe Glu Lys Cys Phe Ala Thr 
355 360 365 

Pro 



<210> 23 
<211> 440 
<212> PRT 

<213> Thermomyces lanuginosus 
<400> 23 

Asn Val Asp He Ala Arg His Trp Gly Gin Tyr Ser Pro Phe Phe Ser 
15 10 15 

Leu Ala Glu Val Ser Glu He Ser Pro Ala Val Pro Lys Gly Cys Arg 

20 25 30 

Val Glu Phe Val Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr 
35 40 45 

Ala His Lys Ser Glu Val Tyr Ala Glu Leu Leu Gin Arg He Gin Asp 
50 55 60 

Thr Ala Thr Glu Phe Lys Gly Asp Phe Ala Phe Leu Arg Asp Tyr Ala 
55 70 75 ^ ^ 
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Tyr His Leu Gly Ala Asp Asn Leu Thr Arg Phe Gly Glu Glu Gin Met 

85 90 95 

Met Glu Ser Gly Arg Gin Phe Tyr His Arg Tyr Arg Glu Gin Ala Arg 

100 105 " 110 

Glu lie Val Pro Phe Val Arg Ala Ala Gly Ser Ala Arg Val lie Ala 
115 120 125 

Ser Ala Glu Phe Phe Asn Arg Gly Phe Gin Asp Ala Lys Asp Arg Asp 
130 135 140 

Pro Arg Ser Asn Lys Asp Gin Ala Glu Pro Val lie Asn Val lie He 
145 150 155 160 

Ser Glu Glu Thr Gly Ser Asn Asn Thr Leu Asp Gly Leu Thr Cys Pro 

165 170 175 

Ala Ala Glu Glu Ala Pro Asp Pro Thr Gin Pro Ala Glu Phe Leu Gin 

180 185 190 

Val Phe Gly Pro Arg Val Leu Lys Lys He Thr Lys His Met Pro Gly 
195 200 " 205 

Val Asn Leu Thr Leu Glu Asp Val Pro Leu Phe Met Asp Leu Cys Pro 
210 215 220 

Phe Asp Thr Val Gly Ser Asp Pro Val Leu Phe Pro Arg Gin Leu Ser 
225 230 235 240 

Pro Phe Cys His Leu Phe Thr Ala Asp Asp Trp Met Ala Tyr Asp Tyr 

245 250 255 

Tyr Tyr Thr Leu Asp Lys Tyr Tyr Ser His Gly Gly Gly Ser Ala Phe 

260 265 270 

Gly Pro Ser Arg Gly Val Gly Phe Val Asn Glu Leu He Ala Arg Met 
275 280 285 

Thr Gly Asn Leu Pro Val Lys Asp His Thr Thr Val Asn His Thr Leu 
290 295 300 

Asp Asp Asn Pro Glu Thr Phe Pro Leu Asp Ala Val Leu Tyr Ala Asp 
305 310 315 320 

Phe Ser His Asp Asn Thr Met Thr Gly He Phe Ser Ala Met Gly Leu 

325 330 335 

Tyr Asn Gly Thr Lys Pro Leu Ser Thr Ser Lys He Gin Pro Pro Thr 

340 345 350 

Gly Ala Ala Ala Asp Gly Tyr Ala Ala Ser Trp Thr Val Pro Phe Ala 
355 360 365 
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Ala Arg Ala Tyr Val 
370 

Glu Glu Glu Glu Glu 
385 

Asn Asp Arg Val Val 

405 

Arg Cys Arg Arg Asp 

420 

Gly Gly His Trp Asp 
435 



Glu Leu Leu Arg Cys Glu 
375 

Gly Glu Asp Glu Pro Phe 

390 395 

Pro Leu His Gly Cys Arg 

410 

Glu Trp lie Lys Gly Leu 

425 

Arg Cys Phe 
440 



Thr Glu Thr Ser Ser 
380 

Val Arg Val Leu Val 

400 

Val Asp Arg Trp Gly 

415 

Thr Phe Ala Arg Gin 
430 



<210> 24 
<211> 441 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 10 
<400> 24 

Asn Ser His Ser Cys Asp Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro 
15 10 15 

Glu He Ser His Leu Trp Gly Gin Tyr Ser Pro Phe Phe Ser Leu Ala 

20 25 30 

Asp Glu Ser Ala He Ser Pro Asp Val Pro Lys Gly Cys Arg Val Thr 
3 5 40 45 

Phe Val Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser 
50 55 60 

Lys Ser Lys Lys Tyr Ser Ala Leu He Glu Ala He Gin Lys Asn Ala 
65 70 75 80 

Thr Ala Phe Lys Gly Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr 

85 90 95 

Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu Gin Gin Met Val Asn 

100 105 110 

Ser Gly He Lys Phe Tyr Arg Arg Tyr Lys Ala Leu Ala Arg Lys He 
115 120 125 

Val Pro Phe Val Arg Ala Ser Gly Ser Asp Arg Val He Ala Ser Ala 
130 135 140 
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Glu Lys Phe lie Glu 
145 

Ala Asn Pro His Gin 

165 

Gly Ala Gly Tyr Asn 

180 

Glu Glu Ser Glu Leu 
195 

Phe Ala Pro Pro lie 
210 

Asn Leu Thr Asp Glu 
225 

Asp Thr Val Ala Arg 

245 

Asp Leu Phe Thr His 

260 

Leu Gly Lys Tyr Tyr 
275 

Gin Gly Val Gly Phe 
290 

Pro Val Gin Asp His 
305 

Ala Thr Phe Pro Leu 

325 

Asn Thr Met Val Ser 

340 

Lys Pro Leu Ser Thr 
355 

Tyr Ala Ala Ser Trp 
370 

Met Met Gin Cys Glu 
385 

Asn Asp Arg Val Val 

405 



42 

Gly Phe Gin Ser Ala Lys 
150 155 

Ala Ser Pro Val lie Asn 

170 

Asn Thr Leu Asp His Gly 

185 

Gly Asp Asp Val Glu Ala 
200 

Arg Ala Arg Leu Glu Ala 
215 

Asp Val Val Asn Leu Met 
230 235 

Thr Ser Asp Ala Thr Gin 

250 

Asp Glu Trp lie Gin Tyr 

265 

Gly Tyr Gly Ala Gly Asn 
280 

Val Asn Glu Leu lie Ala 
295 

Thr Ser Thr Asn His Thr 
310 315 

Asn Ala Thr Leu Tyr Ala 

330 

lie Phe Phe Ala Leu Gly 

345. 

Thr Ser Val Glu Ser lie 
360 

Thr Val Pro Phe Ala Ala 
375 

Ala Glu Lys Glu Pro Leu 
390 395 

Pro Leu His Gly Cys Gly 

410 



Leu Ala Asp Pro Gly 

160 

Val He He Pro Glu 

175 

Leu Cys Thr Ala Phe 
190 

Asn Phe Thr Ala Val 
205 

His Leu Pro Gly Val 
220 

Asp Met Cys Pro Phe 

240 

Leu Ser Pro Phe Cys 

255 

Asp Tyr Leu Gin Ser 
270 

Pro Leu Gly Pro Ala 
285 

Arg Leu Thr His Ser 
300 

Leu Asp Ser Asn Pro 

320 

Asp Phe Ser His Asp 

335 

Leu Tyr Asn Gly Thr 
350 

Glu Glu Thr Asp Gly 
365 

Arg Ala Tyr Val Glu 
380 

Val Arg Val Leu Val 

400 

Val Asp Lys Leu Gly 

415 



Arg Cys Lys Arg Asp Asp Phe Val Glu Gly Leu Ser Phe Ala Arg Ser 

420 425 430 
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Gly Gly Asn Trp Glu Glu Cys Phe Ala 
435 440 



<210> 25 
<211> 1426 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 10 

<220> 

<221> CDS 

<222> (12) . . (1412) 

<220> 

<221> mat_peptide 
<222> (90) . . (1412) 

<220> 

<221> sig_peptide 
<222> (12) . . (89) 

<400> 25 

tatatgaatt c atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg 50 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu 
-25 -20 -15 

ttc ggt tec aca tec ggt acc gec ttg ggt cct cgt ggt aat tct cac 98 
Phe Gly Ser Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His 

-10 -5 -l i 

tct tgt gac act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct 146 
Ser Cys Asp Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu lie Ser 
5 10 15 

cac ttg tgg ggt caa tac tct cca ttc ttc tct ttg get gac gaa tct 194 
His Leu Trp Gly Gin Tyr Ser Pro Phe Phe Ser Leu Ala Asp Glu Ser 
20 25 30 35 

get att tct cca gac gtt cca aag ggt tgt aga gtt act ttc gtt caa 242 
Ala He Ser Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin 

40 45 50 

gtt ttg tct aga cac ggt get aga tac cca act tct tct aag tct aag 290 
Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys 

55 60 65 

aag tac tct get ttg att gaa get att caa aag aac get act get ttc 338 
Lys Tyr Ser Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe 
70 75 80 
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aag ggt aag tac 
Lys Gly Lys Tyr 
85 

gac gac ttg act 
Asp Asp Leu Thr 
100 

aag ttc tac aga 
Lys Phe Tyr Arg 



gtt aga get tct 
Val Arg Ala Ser 

135 

att gaa ggt ttc 
lie Glu Gly Phe 
150 

cac caa get tct 
His Gin Ala Ser 
165 

tac aac aac act 
Tyr Asn Asn Thr 
180 

gaa ttg ggt gac 
Glu Leu Gly Asp 



cct att aga get 
Pro lie Arg Ala 

215 

gac gaa gac gtt 
Asp Glu Asp Val 
230 

get aga act tct 
Ala Arg Thr Ser 
245 

act cac gac gaa 
Thr His Asp Glu 
260 

tac tac ggt tac 
Tyr Tyr Gly Tyr 



get ttc ttg 
Ala Phe Leu 
90 

cca ttc ggt 
Pro Phe Gly 
105 

aga tac aag 
Arg Tyr Lys 
120 

ggt tct gac 
Gly Ser Asp 



caa tct get 
Gin Ser Ala 



cca gtt att 
Pro Val He 
170 

ttg gac cac 
Leu Asp His 
185 

gac gtt gaa 
Asp Val Glu 
200 

aga ttg gaa 
Arg Leu Glu 



gtt aac ttg 
Val Asn Leu 



gac get act 
Asp Ala Thr 
250 

tgg att caa 
Trp He Gin 
265 

ggt get ggt 
Gly Ala Gly 
280 



aag act tac aac 
Lys Thr Tyr Asn 



gaa caa caa atg 
Glu Gin Gin Met 

110 

get ttg get aga 
Ala Leu Ala Arg 
125 

aga gtt att get 
Arg Val He Ala 
140 

aag ttg get gac 
Lys Leu Ala Asp 
155 

aac gtt att att 
Asn Val He He 



ggt ttg tgt act 
Gly Leu Cys Thr 

190 

get aac ttc act 
Ala Asn Phe Thr 
205 

get cac ttg cca 
Ala His Leu Pro 
220 

atg gac atg tgt 
Met Asp Met Cys 
235 

caa ttg tct cca 
Gin Leu Ser Pro 



tac gac tac ttg 
Tyr Asp Tyr Leu 

270 

aac cca ttg ggt 
Asn Pro Leu Gly 
285 



tac act ttg 
Tyr Thr Leu 
95 

gtt aac tct 
Val Asn Ser 



aag att gtt 
Lys He Val 



tct get gaa 
Ser Ala Glu 
145 

cca ggt get 
Pro Gly Ala 
160 

cca gaa ggt 
Pro Glu Gly 
175 

get ttc gaa 
Ala Phe Glu 



get gtt ttc 
Ala Val Phe 



ggt gtt aac 
Gly Val Asn 
225 

cca ttc gac 
Pro Phe Asp 
240 

ttc tgt gac 
Phe Cys Asp 
255 

caa tct ttg 
Gin Ser Leu 



cca get caa 
Pro Ala Gin 



ggt get 386 
Gly Ala 



ggt att 434 
Gly He 
115 

cca ttc 482 
Pro Phe 
130 

aag ttc 530 
Lys Phe 



aac cca 578 
Asn Pro 



get ggt 626 
Ala Gly 



gaa tct 674 
Glu Ser 
195 

get cca 722 

Ala Pro 

210 

ttg act 770 
Leu Thr 



act gtt 818 
Thr Val 



ttg ttc 866 
Leu Phe 



ggt aag 914 
Gly Lys 
275 

ggt gtt 962 

Gly Val 

290 
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ggt ttc gtt aac gaa ttg att get aga ttg act cac tct cca gtt caa 1010 
Gly Phe Val Asn Glu Leu lie Ala Arg Leu Thr His Ser Pro Val Gin 

295 300 305 

gac cac act tct act aac cac act ttg gac tct aac cca get act ttc 1058 
Asp His Thr Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe 
310 315 320 

cca ttg aac get act ttg tac get gac ttc tct cac gac aac act atg 1106 
Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met 
325 330 335 

gtt tct att ttc ttc get ttg ggt ttg tac aac ggt act aag cca ttg 1154 
Val Ser He Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu 
340 345 350 355 

tct act act tct gtt gaa tct att gaa gaa act gac ggt tac get get 1202 
Ser Thr Thr Ser Val Glu Ser He Glu Glu Thr Asp Gly Tyr Ala Ala 

360 365 370 

tct tgg act gtt cca ttc get get aga get tac gtt gaa atg atg caa 1250 
Ser Trp Thr Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin 

375 380 385 

tgt gaa get gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga 1298 
Cys Glu Ala Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg 
390 395 400 

gtt gtt cca ttg cac ggt tgt ggt gtt gac aag ttg ggt aga tgt aag 1346 
Val Val Pro Leu His Gly Cys Gly Val Asp Lys Leu Gly Arg Cys Lys 
405 410 * 415 

aga gac gac ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac 13 94 
Arg Asp Asp Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn 
420 425 430 435 

tgg gaa gaa tgt ttc get taagaattca tata 1426 
Trp Glu Glu Cys Phe Ala 

440 



<210> 26 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: consensus 
phytase 10 

<400> 26 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 
-25 -20 -is 



Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
"10 -5 -11 5 
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Thr Val Asp Gly 

10 

Gly Gin Tyr Ser 
25 



Gly Tyr Gin Cys 



Pro Phe Phe Ser 

30 



Phe Pro Glu lie 
15 

Leu Ala Asp Glu 



Ser His Leu Trp 
20 

Ser Ala lie Ser 
35 



Pro Asp Val Pro Lys 
40 

Arg His Gly Ala Arg 
55 

Ala Leu He Glu Ala 

75 

Tyr Ala Phe Leu Lys 

90 

Thr Pro Phe Gly Glu 
105 

Arg Arg Tyr Lys Ala 
120 

Ser Gly Ser Asp Arg 
135 

Phe Gin Ser Ala Lys 

155 

Ser Pro Val He Asn 

170 

Thr Leu Asp His Gly 
185 

Asp Asp Val Glu Ala 
200 



Gly Cys Arg Val Thr Phe 
45 

Tyr Pro Thr Ser Ser Lys 
60 65 

He Gin Lys Asn Ala Thr 

80 

Thr Tyr Asn Tyr Thr Leu 

95 

Gin Gin Met Val Asn Ser 
110 

Leu Ala Arg Lys He Val 
125 

Val He Ala Ser Ala Glu 
140 145 

Leu Ala Asp Pro Gly Ala 

160 

Val He He Pro Glu Gly 

175 

Leu Cys Thr Ala Phe Glu 
190 

Asn Phe Thr Ala Val Phe 
205 



Val Gin Val Leu Ser 
50 

Ser Lys Lys Tyr Ser 

70 

Ala Phe Lys Gly Lys 

85 

Gly Ala Asp Asp Leu 
100 

Gly He Lys Phe Tyr 
115 

Pro Phe Val Arg Ala 
130 

Lys Phe He Glu Gly 

150 

Asn Pro His Gin Ala 

165 

Ala Gly Tyr Asn Asn 
180 

Glu Ser Glu Leu Gly 
195 

Ala Pro Pro He Arg 
210 



Ala Arg Leu Glu Ala 
215 

Val Val Asn Leu Met 

235 

Ser Asp Ala Thr Gin 

250 

Glu Trp He Gin Tyr 
265 

Tyr Gly Ala Gly Asn 
280 



His Leu Pro Gly Val Asn 
220 225 

Asp Met Cys Pro Phe Asp 

240 

Leu Ser Pro Phe Cys Asp 

255 

Asp Tyr Leu Gin Ser Leu 
270 

Pro Leu Gly Pro Ala Gin 
285 



Leu Thr Asp Glu Asp 

230 

Thr Val Ala Arg Thr 

245 

Leu Phe Thr His Asp 
260 

Gly Lys Tyr Tyr Gly 
275 

Gly Val Gly Phe Val 
290 
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Asn Glu Leu lie 
295 

Ser Thr Asn His 



Ala Thr Leu Tyr 

330 

Phe Phe Ala Leu 
345 

Ser Val Glu Ser 
360 

Val Pro Phe Ala 
375 

Glu Lys Glu Pro 



Leu His Gly Cys 

410 

Phe Val Glu Gly 
425 



Ala Arg Leu Thr 
300 

Thr Leu Asp Ser 
315 

Ala Asp Phe Ser 



Gly Leu Tyr Asn 

350 

lie Glu Glu Thr 
365 

Ala Arg Ala Tyr 
380 

Leu Val Arg Val 
395 

Gly Val Asp Lys 



Leu Ser Phe Ala 

430 



His Ser Pro Val 
305 

Asn Pro Ala Thr 
320 

His Asp Asn Thr 
335 

Gly Thr Lys Pro 



Asp Gly Tyr Ala 

370 

Val Glu Met Met 
385 

Leu Val Asn Asp 
400 

Leu Gly Arg Cys 
415 

Arg Ser Gly Gly 



Gin Asp His Thr 

310 

Phe Pro Leu Asn 
325 

Met Val Ser lie 
340 

Leu Ser Thr Thr 
355 

Ala Ser Trp Thr 



Gin Cys Glu Ala 

390 

Arg Val Val Pro 
405 

Lys Arg Asp Asp 
420 

Asn Trp Glu Glu 
435 



Cys Phe Ala 
440 



<210> 27 
<211> 437 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 11 



<400> 27 
Asn Ser His Ser 
1 

Ser His Leu Trp 

20 

Ser Ala He Ser 
35 

Gin Val Leu Ser 
50 



Cys Asp Thr Val 
5 

Gly Gin Tyr Ser 

Pro Asp Val Pro 

40 

Arg His Gly Ala 
55 



Asp Gly Tyr Gin 
10 

Pro Phe Phe Ser 
25 

Lys Gly Cys Arg 



Arg Tyr Pro Thr 

60 



Cys Pro Glu He 
15 

Leu Ala Asp Glu 
30 

Val Thr Phe Val 
45 

Ser Ser Lys Ser 
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Lys Lys Tyr Ser Ala Leu lie Glu Arg He Gin Lys Asn Ala Thr Phe 
65 70 75 80 

Lys Gly Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala 

85 90 95 

Asp Asp Leu Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly He 

100 105 no 

Lys Phe Tyr Arg Arg Tyr Lys Ala Leu Ala Arg Asn He Val Pro Phe 
115 120 125 

Val Arg Ala Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe 
130 135 140 

He Glu Gly Phe Gin Ser Ala Lys Leu Ala Asp Pro Ala His Gin Ala 
145 150 155 160 

Ser Pro Val He Asn Val He He Pro Glu Gly Ser Gly Tyr Asn Asn 

165 170 175 

Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Asp Ser Thr Leu Gly 

180 185 190 

Asp Asp Ala Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro He Arg 
195 200 205 

Ala Arg Leu Glu Ala Leu Pro Gly Val Asn Leu Thr Asp Glu Asp Val 
210 215 220 

Val Asn Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr Ser 
225 230 235 240 

Asp Ala Thr Gin Leu Ser Pro Phe Cys Asp Leu Phe Thr Ala Asp Glu 

245 250 255 

Trp Gin Tyr Asp Tyr Leu Gin Ser Leu Lys Tyr Tyr Gly Tyr Gly Ala 

260 265 270 

Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Asn Glu Leu He 
275 280 285 

Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr Ser Thr Asn His 
290 295 300 

Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr Leu Tyr 
305 310 315 320 

Ala Asp Phe Ser His Asp Asn Thr Met Val Ser He Phe Phe Ala Leu 

325 330 335 



Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr Ser Val Glu Ser 

340 345 350 
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lie Glu Thr Asp Gly 
355 

Arg Ala Tyr Val Glu 
370 

Glu Gly Glu Lys Glu 
385 

Val Pro Leu His Gly 

405 

Asp Asp Phe Val Glu 

420 

Ala Glu Cys Phe Ala 
435 



Tyr Ala Ala Ser Trp Thr 
360 

Met Met Gin Cys Glu Ala 
375 

Pro Leu Val Arg Val Leu 
390 395 

Cys Gly Val Asp Lys Leu 

410 

Gly Leu Ser Phe Ala Arg 

425 



Val Pro Phe Ala Ala 
365 

Gly Gly Gly Gly Gly 
380 

Val Asn Asp Arg Val 

400 

Gly Arg Cys Lys Leu 

415 

Ser Gly Gly Asn Trp 
430 



<210> 28 
<211> 1404 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 1 thermo 8 q50t, k91a 

<220> 

<221> CDS 

<222> (1) . . (1401) 

<220> 

<221> mat_peptide 
<222> (79) . . (1401) 

<220> 

<221> sig_peptide 
<222> (1) . . (78) 

<400> 28 

atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg ttc ggt tec 48 
Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 
-25 -20 -15 

aca tec ggt acc gec ttg ggt cct cgt ggt aat tct cac tct tgt gac 96 
Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-10 -5 -11 5 



act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct cac ttg tgg 
Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 

10 15 20 



144 
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ggt acc tac tct cca tac ttc tct ttg gca gac gaa tct get att tct 192 
Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Ala Asp Glu Ser Ala lie Ser 
25 30 35 

cca gac gtt cca gac gac tgt aga gtt act ttc gtt caa gtt ttg tct 240 
Pro Asp Val Pro Asp Asp Cys Arg Val Thr Phe Val Gin Val Leu Ser 
40 45 50 

aga cac ggt get aga tac cca act tct tct gcg tct aag get tac tct 288 
Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
55 60 65 70 

get ttg att gaa get att caa aag aac get act get ttc aag ggt aag 336 
Ala Leu lie Glu Ala lie Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 

75 80 85 

tac get ttc ttg aag act tac aac tac act ttg ggt get gac gac ttg 384 
Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 

90 95 100 

act cca ttc ggt gaa aac caa atg gtt aac tct ggt att aag ttc tac 432 
Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly lie Lys Phe Tyr 
105 110 115 

aga aga tac aag get ttg get aga aag att gtt cca ttc att aga get 4 80 
Arg Arg Tyr Lys Ala Leu Ala Arg Lys lie Val Pro Phe He Arg Ala 
120 125 130 

tct ggt tct gac aga gtt att get tct get gaa aag ttc att gaa ggt 52 8 
Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
135 140 145 150 

ttc caa tct get aag ttg get gac cca ggt tct caa cca cac caa get 576 
Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ser Gin Pro His Gin Ala 

155 160 165 

tct cca gtt att aac gtg ate att cca gaa gga tec ggt tac aac aac 624 
Ser Pro Val He Asn Val He He Pro Glu Gly Ser Gly Tyr Asn Asn 

170 175 180 

act ttg gac cac ggt act tgt act get ttc gaa gac tct gaa tta ggt 672 
Thr Leu Asp His Gly Thr Cys Thr Ala Phe Glu Asp Ser Glu Leu Gly 
185 190 195 

gac gac gtt gaa get aac ttc act get ttg ttc get cca get att aga 720 
Asp Asp Val Glu Ala Asn Phe Thr Ala Leu Phe Ala Pro Ala He Arg 
200 205 210 



get aga ttg gaa get gac ttg cca ggt gtt act ttg act gac gaa gac 
Ala Arg Leu Glu Ala Asp Leu Pro Gly Val Thr Leu Thr Asp Glu Asp 
215 220 225 230 



768 



WO 00/43503 



PCT/DK0O/00025 



51 

gtt gtt tac ttg atg gac atg tgt cca ttc gac act gtc get aga act 816 
Val Val Tyr Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 

235 240 245 

tct gac get act gaa ttg tct cca ttc tgt get ttg ttc act cac gac 864 
Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Ala Leu Phe Thr His Asp 

250 255 260 

gaa tgg ate caa tac gac tac ttg caa age ttg ggt aag tac tac ggt 912 
Glu Trp He Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 
265 270 275 

tac ggt get ggt aac cca ttg ggt cca get caa ggt gtt ggt ttc get 960 
Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Ala 
280 285 290 

aac gaa ttg att get aga ttg act cac tct cca gtt caa gac cac act 1008 
Asn Glu Leu He Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
295 300 305 310 

tct act aac cac act ttg gac tct aac cca get act ttc cca ttg aac 1056 
Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 

315 320 325 

get act ttg tac get gac ttc tct cac gac aac act atg ata tct att 1104 
Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met He Ser He 

330 335 340 

ttc ttc get ttg ggt ttg tac aac ggt ace aag cca ttg tct act act 1152 
Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 
345 350 355 

tct gtt gaa tct att gaa gaa act gac ggt tac tct get tct tgg act 12 00 
Ser Val Glu Ser He Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 
360 365 370 

gtt cca ttc get get aga get tac gtt gaa atg atg caa tgt caa get 124 8 
Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin Cys Gin Ala 
375 380 385 390 

gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga gtt gtt cca 12 96 
Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 

395 400 405 

ttg cac ggt tgt get gtt gac aag ttg ggt aga tgt aag aga gac gac 1344 
Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys Arg Asp Asp 

410 415 420 

ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac tgg get gaa 1392 
Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Ala Glu 
425 430 435 
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tgt ttc get 
Cys Phe Ala 
440 



taa 



1404 



<210> 29 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: consensus 
phytase 1 thermo 8 q50t, k91a 

<400> 29 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 
-25 -20 -15 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-10 -5 -11 5 

Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu lie Ser His Leu Trp 

10 15 20 

Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Ala Asp Glu Ser Ala lie Ser 
25 30 35 

Pro Asp Val Pro Asp Asp Cys Arg Val Thr Phe Val Gin Val Leu Ser 
40 45 50 

Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
55 60 65 70 

Ala Leu lie Glu Ala lie Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 

75 80 85 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 

90 95 100 

Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly lie Lys Phe Tyr 
105 110 " 115 

Arg Arg Tyr Lys Ala Leu Ala Arg Lys lie Val Pro Phe lie Arg Ala 
120 125 130 

Ser Gly Ser Asp Arg. Val lie Ala Ser Ala Glu Lys Phe lie Glu Gly 
135 140 145 150 

Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ser Gin Pro His Gin Ala 

155 160 165 

Ser Pro Val lie Asn Val lie lie Pro Glu Gly Ser Gly Tyr Asn Asn 

170 175 180 



Thr Leu Asp His Gly Thr Cys Thr Ala Phe Glu Asp Ser Glu Leu Gly 
185 190 195 
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Asp Asp Val Glu 
200 

Ala Arg Leu Glu 
215 

Val Val Tyr Leu 



Ser Asp Ala Thr 

250 

Glu Trp lie Gin 
265 

Tyr Gly Ala Gly 
280 

Asn Glu Leu lie 
295 

Ser Thr Asn His 



Ala Thr Leu Tyr 

330 

Phe Phe Ala Leu 
345 

Ser Val Glu Ser 
360 

Val Pro Phe Ala 

375 

Glu Lys Glu Pro 



Leu His Gly Cys 

410 

Phe val Glu Gly 
425 

Cys Phe Ala 
440 



<210> 30 
<211> 1404 
<212> DNA 



Ala Asn Phe Thr 
205 

Ala Asp Leu Pro 
220 

Met Asp Met Cys 
235 

Glu Leu Ser Pro 



Tyr Asp Tyr Leu 

270 

Asn Pro Leu Gly 
285 

Ala Arg Leu Thr 
300 

Thr Leu Asp Ser 
315 

Ala Asp Phe Ser 



Gly Leu Tyr Asn 

350 

lie Glu Glu Thr 
365 

Ala Arg Ala Tyr 
380 

Leu Val Arg Val 
395 

Ala Val Asp Lys 



Leu Ser Phe Ala 

430 
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Ala Leu Phe Ala 

210 

Gly Val Thr Leu 
225 

Pro Phe Asp Thr 
240 

Phe Cys Ala Leu 
255 

Gin Ser Leu Gly 



Pro Ala Gin Gly 

290 

His Ser Pro Val 
305 

Asn Pro Ala Thr 
320 

His Asp Asn Thr 
335 

Gly Thr Lys Pro 



Asp Gly Tyr Ser 

370 

Val Glu Met Met 
385 

Leu Val Asn Asp 
400 

Leu Gly Arg Cys 
415 

Arg Ser Gly Gly 



Pro Ala lie Arg 



Thr Asp Glu Asp 

230 

Val Ala Arg Thr 
245 

Phe Thr His Asp 
260 

Lys Tyr Tyr Gly 
275 

Val Gly Phe Ala 



Gin Asp His Thr 

310 

Phe Pro Leu Asn 
325 

Met lie Ser He 
340 

Leu Ser Thr Thr 
355 

Ala Ser Trp Thr 



Gin Cys Gin Ala 

390 

Arg Val Val Pro 
405 

Lys Arg Asp Asp 
420 

Asn Trp Ala Glu 
435 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 10 thermo 3 q50t, k9la 



<220> 

<221> CDS 

<222> (1) . . (1401) 



<220> 

<221> mat_peptide 
<222> (79) . . (1401) 



<220> 

<221> sig_ peptide 
<222> (1) . . (78) 



<400> 30 

atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg ttc ggt tec 48 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 
-25 -20 -is 



aca tec ggt acc gec ttg ggt cct cgt ggt aac tct cac tct tgt gac 96 
Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-10 -5 -11 5 



act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct cac ttg tgg 144 
Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 

10 15 20 

ggt aca tac tct cca ttc ttc tct ttg get gac gaa tct get att tct 192 
Gly Thr Tyr Ser Pro Phe Phe Ser Leu Ala Asp Glu Ser Ala He Ser 
25 30 35 



cca gac gtt cca aag ggt tgt aga gtt act ttc gtt caa gtt ttg tct 240 
Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin Val Leu Ser 
40 45 50 

aga cac ggt get aga tac cca act tct tct gcg tct aag gcg tac tct 288 
Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
55 60 65 70 

get ttg att gaa get att caa aag aac get act get ttc aag ggt aag 336 
Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 

75 80 85 

tac get ttc ttg aag act tac aac tac act ttg ggt get gac gac ttg 384 
Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 

90 95 ioo 



act cca ttc ggt gaa caa caa atg gtt aac tct ggt att aag ttc tac 432 
Thr Pro Phe Gly Glu Gin Gin Met Val Asn Ser Gly He Lys Phe Tyr 
105 no us 
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aga aga tac aag get ttg get aga aag att gtt cca ttc att aga get 480 
Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 
120 125 130 

tct ggt tct gac aga gtt att get tct get gaa aag ttc att gaa ggt 528 
Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
135 140 145 150 

ttc caa tct get aag ttg get gac cca ggt get aac cca cac caa get 576 
Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ala Asn Pro His Gin Ala 

155 160 165 

tct cca gtt att aac gtt att att cca gaa ggt get ggt tac aac aac 624 
Ser Pro Val He Asn Val He He Pro Glu Gly Ala Gly Tyr Asn Asn 

170 175 180 

act ttg gac cac ggt ttg tgt act get ttc gaa gaa tct gaa ttg ggt 672 
Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Glu Ser Glu Leu Gly 
185 190 195 

gac gac gtt gaa get aac ttc act get gtt ttc get cca cca att aga 72 0 
Asp Asp Val Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro He Arg 
200 205 210 

get aga ttg gaa get cac ttg cca ggt gtt aac ttg act gac gaa gac 768 
Ala Arg Leu Glu Ala His Leu Pro Gly Val Asn Leu Thr Asp Glu Asp 
215 220 225 230 

gtt gtt aac ttg atg gac atg tgt cca ttc gac act gtt get aga act 816 
Val Val Asn Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 

235 240 245 

tct gac get act caa ttg tct cca ttc tgt gac ttg ttc act cac gac 864 
Ser Asp Ala Thr Gin Leu Ser Pro Phe Cys Asp Leu Phe Thr His Asp 

250 255 260 

gaa tgg att caa tac gac tac ttg caa tct ttg ggt aag tac tac ggt 912 
Glu Trp He Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 
265 270 275 

tac ggt get ggt aac cca ttg ggt cca get caa ggt gtt ggt ttc gtt 960 
Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val 
280 285 290 

aac gaa ttg att get aga ttg act cac tct cca gtt caa gac cac act 1008 
Asn Glu Leu He Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
295 300 305 310 

tct act aac cac act ttg gac tct aac cca get act ttc cca ttg aac 1056 
Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 

315 320 325 
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get act ttg tac get gac ttc tct cac gac aac act atg gtt tct att 1104 
Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met Val Ser lie 

330 335 340 

ttc ttc get ttg ggt ttg tac aac ggt act aag cca ttg tct act act 1152 
Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 
345 350 355 

tct gtt gaa tct att gaa gaa act gac ggt tac tct get tct tgg act 1200 
Ser Val Glu Ser lie Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 
360 365 370 

gtt cca ttc get get aga get tac gtt gaa atg atg caa tgt gaa get 1248 
Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin Cys Glu Ala 
375 380 385 390 

gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga gtt gtt cca 1296 
Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 

395 400 405 

ttg cac ggt tgt ggt gtt gac aag ttg ggt aga tgt aag aga gac gac 1344 
Leu His Gly Cys Gly Val Asp Lys Leu Gly Arg Cys Lys Arg Asp Asp 

410 415 420 

ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac tgg gaa gaa 1392 
Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Glu Glu 
425 430 435 

tgt ttc get taa 1404 
Cys Phe Ala 
440 



<210> 31 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: consensus 
phytase 10 thermo 3 q50t, k91a 

<400> 31 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 
-25 -20 -15 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-10 -5 -11 5 

Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 

10 15 20 

Gly Thr Tyr Ser Pro Phe Phe Ser Leu Ala Asp Glu Ser Ala He Ser 
25 30 35 



WO 00/43503 



PCT/DK00/00025 



57 

Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin Val Leu Ser 
40 45 50 

Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
55 60 65 70 

Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 

75 80 85 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 

90 95 100 

Thr Pro Phe Gly Glu Gin Gin Met Val Asn Ser Gly He Lys Phe Tyr 
105 110 us 

Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 
120 125 130 

Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
135 140 145 150 

Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ala Asn Pro His Gin Ala 

155 160 165 

Ser Pro Val He Asn Val He He Pro Glu Gly Ala Gly Tyr Asn Asn 

170 175 180 

Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Glu Ser Glu Leu Gly 
185 190 195 

Asp Asp Val Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro He Arg 
200 205 210 

Ala Arg Leu Glu Ala His Leu Pro Gly Val Asn Leu Thr Asp Glu Asp 
215 220 225 230 

Val Val Asn Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 

235 240 245 

Ser Asp Ala Thr Gin Leu Ser Pro Phe Cys Asp Leu Phe Thr His Asp 

250 255 260 

Glu Trp lie Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 
265 270 275 

Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val 
280 285 290 

Asn Glu Leu He Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
295 300 305 310 

Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 

315 320 325 
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Ala Thr Leu Tyr 

330 

Phe Phe Ala Leu 
345 

Ser Val Glu Ser 
360 

Val Pro Phe Ala 
375 

Glu Lys Glu Pro 



Leu His Gly Cys 

410 

Phe Val Glu Gly 
425 

Cys Phe Ala 
440 



Ala Asp Phe Ser 



Gly Leu Tyr Asn 

350 

lie Glu Glu Thr 
365 

Ala Arg Ala Tyr 
380 

Leu Val Arg Val 
395 

Gly Val Asp Lys 



Leu Ser Phe Ala 

430 
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His Asp Asn Thr 
335 

Gly Thr Lys Pro 



Asp Gly Tyr Ser 

370 

Val Glu Met Met 
385 

Leu Val Asn Asp 
400 

Leu Gly Arg Cys 
415 

Arg Ser Gly Gly 



Met Val Ser lie 
340 

Leu Ser Thr Thr 
355 

Ala Ser Trp Thr 



Gin Cys Glu Ala 

390 

Arg Val Val Pro 
405 

Lys Arg Asp Asp 
420 

Asn Trp Glu Glu 
435 



<210> 32 
<211> 1404 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Aspergillus 
fumigatus alpha-mutant 

<220> 

<221> CDS 

<222> (1) . . (1401) 

<220> 

<221> matj>eptide 
<222> (79) . . (1401) 

<220> 

<221> sig_j?eptide 
<222> (1) . . (78) 

<400> 32 

atg ggg gtt ttc gtc gtt eta tta tct ate gcg act ctg ttc ggc age 48 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 
-25 -20 -15 
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aca teg ggc act gcg ctg ggc ccc cgt gga aat cac tec aag tec tgc 96 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn His Ser Lys Ser Cys 
-10 -5 -11 5 

gat acg gta gac eta ggg tac cag tgc tec cct gcg act tct cat eta 144 
Asp Thr Val Asp lieu Gly Tyr Gin Cys Ser Pro Ala Thr Ser His Leu 

10 15 20 

tgg 99 c acg tac teg cca tac ttt teg etc gag gac gag ctg tec gtg 192 
Trp Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Glu Asp Glu Leu Ser Val 
25 30 35 

teg agt aag ctt ccc aag gat tgc egg ate ace ttg gta cag gtg eta 240 
Ser Ser Lys Leu Pro Lys Asp Cys Arg lie Thr Leu Val Gin Val Leu 
40 45 50 

teg cgc cat gga gcg egg tac cca ace age tec aag age aaa aag tat 288 
Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Lys Tyr 
55 60 65 70 

aag aag ctt att acg gcg ate cag gee aat gee ace gac ttc aag ggc 336 
Lys Lys Leu lie Thr Ala lie Gin Ala Asn Ala Thr Asp Phe Lys Gly 

75 80 85 

aag tac gee ttt ttg aag acg tac aac tat act ctg ggt gcg gat gac 384 
Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp 

90 95 100 

etc act ccc ttt ggg gag cag cag ctg gtg aac teg ggc ate aag ttc 432 
Leu Thr Pro Phe Gly Glu Gin Gin Leu Val Asn Ser Gly lie Lys Phe 
105 110 115 

tac cag agg tac aag get ctg gcg cgc agt gtg gtg ccg ttt att cgc 480 
Tyr Gin Arg Tyr Lys Ala Leu Ala Arg Ser Val Val Pro Phe lie Arg 
120 125 130 

gee tea ggc teg gac egg gtt att get teg gga gag aag ttc ate gag 528 
Ala Ser Gly Ser Asp Arg Val He Ala Ser Gly Glu Lys Phe He Glu 
135 140 145 150 

ggg ttc cag cag gcg aag ctg get gat cct ggc gcg acg aac cgc gee 576 
Gly Phe Gin Gin Ala Lys Leu Ala Asp Pro Gly Ala Thr Asn Arg Ala 

155 160 165 

get ccg gcg att agt gtg att att ccg gag age gag acg ttc aac aat 624 
Ala Pro Ala He Ser Val He He Pro Glu Ser Glu Thr Phe Asn Asn 

170 175 180 

acg ctg gac cac ggt gtg tgc acg aag ttt gag gcg agt cag ctg gga 672 
Thr Leu Asp His Gly Val Cys Thr Lys Phe Glu Ala Ser Gin Leu Gly 
185 190 195 
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gat gag gtt gcg gcc aat ttc act gcg etc ttt gca ccc gac ate cga 720 
Asp Glu Val Ala Ala Asn Phe Thr Ala Leu Phe Ala Pro Asp lie Arg 
200 205 210 

get cgc etc gag aag cat ctt cct ggc gtg acg ctg aca gac gag gac 768 
Ala Arg Leu Glu Lys His Leu Pro Gly Val Thr Leu Thr Asp Glu Asp 
215 220 225 230 

gtt gtc agt eta atg gac atg tgt ccg ttt gat acg gta gcg cgc ace 816 
Val Val Ser Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 

235 240 245 

age gac gca agt cag ctg tea ccg ttc tgt caa etc ttc act cac aat 864 
Ser Asp Ala Ser Gin Leu Ser Pro Phe Cys Gin Leu Phe Thr His Asn 

250 255 260 

gag tgg aag aag tac gac tac ctt cag tec ttg ggc aag tac tac ggc 912 
Glu Trp Lys Lys Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 
265 270 275 

tac ggc gca ggc aac cct ctg gga ccg get cag ggg ata ggg ttc ace 960 
Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly lie Gly Phe Thr 
280 285 290 

aac gag ctg att gcc egg ttg acg cgt teg cca gtg cag gac cac ace 1008 
Asn Glu Leu lie Ala Arg Leu Thr Arg Ser Pro Val Gin Asp His Thr 
295 300 305 310 

age act aac teg act eta gtc tec aac ccg gcc ace ttc ccg ttg aac 1056 
Ser Thr Asn Ser Thr Leu Val Ser Asn Pro Ala Thr Phe Pro Leu Asn 

315 320 325 

get ace atg tac gtc gac ttt tea cac gac aac age atg gtt tec ate 1104 
Ala Thr Met Tyr Val Asp Phe Ser His Asp Asn Ser Met Val Ser lie 

330 335 340 

ttc ttt gca ttg ggc ctg tac aac ggc act gaa ccc ttg tec egg ace 1152 
Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Glu Pro Leu Ser Arg Thr 
345 350 355 

teg gtg gaa age gcc aag gaa ttg gat ggg tat tct gca tec tgg gtg 1200 
Ser Val Glu Ser Ala Lys Glu Leu Asp Gly Tyr Ser Ala Ser Trp Val 
360 365 370 

gtg cct ttc ggc gcg cga gcc tac ttc gag acg atg caa tgc aag teg 1248 
Val Pro Phe Gly Ala Arg Ala Tyr Phe Glu Thr Met Gin Cys Lys Ser 
375 380 385 390 



gaa aag gag cct ctt gtt cgc get ttg att aat gac egg gtt gtg cca 1296 
Glu Lys Glu Pro Leu Val Arg Ala Leu lie Asn Asp Arg Val Val Pro 

395 400 405 
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ctg cat ggc tgc gat gtg gac aag ctg ggg cga tgc aag ctg aat gac 1344 
Leu His Gly Cys Asp Val Asp Lys Leu Gly Arg Cys Lys Leu Asn Asp 

410 415 420 

ttt gtc aag gga ttg agt tgg gcc aga tct ggg ggc aac tgg gga gag 1392 
Phe Val Lys Gly Leu Ser Trp Ala Arg Ser Gly Gly Asn Trp Gly Glu 
425 430 435 

tgc ttt agt tga 1404 
Cys Phe Ser 
440 



<210> 33 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Aspergillus 
fumigatus alpha-mutant 

<400> 33 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 
-25 -20 -15 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn His Ser Lys Ser Cys 
-10 -5 -11 5 

Asp Thr Val Asp Leu Gly Tyr Gin Cys Ser Pro Ala Thr Ser His Leu 

10 15 20 

Trp Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Glu Asp Glu Leu Ser Val 
25 30 35 

Ser Ser Lys Leu Pro Lys Asp Cys Arg He Thr Leu Val Gin Val Leu 
40 45 50 

Ser Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Lys Tyr 
55 60 65 70 

Lys Lys Leu He Thr Ala He Gin Ala Asn Ala Thr Asp Phe Lys Gly 

75 80 85 

Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp 

90 95 100 

Leu Thr Pro Phe Gly Glu Gin Gin Leu Val Asn Ser Gly He Lys Phe 
105 110 us 

Tyr Gin Arg Tyr Lys Ala Leu Ala Arg Ser Val Val Pro Phe He Arg 
120 125 130 

Ala Ser Gly Ser Asp Arg Val He Ala Ser Gly Glu Lys Phe He Glu 
135 140 145 150 
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Gly Phe Gin Gin Ala Lys Leu Ala Asp Pro Gly Ala Thr Asn Arg Ala 

155 160 165 

Ala Pro Ala lie Ser Val lie He Pro Glu Ser Glu Thr Phe Asn Asn 

170 175 180 

Thr Leu Asp His Gly Val Cys Thr Lys Phe Glu Ala Ser Gin Leu Gly 
185 190 195 

Asp Glu Val Ala Ala Asn Phe Thr Ala Leu Phe Ala Pro Asp He Arg 
200 205 210 

Ala Arg Leu Glu Lys His Leu Pro Gly Val Thr Leu Thr Asp Glu Asp 
215 220 225 230 

Val Val Ser Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 

235 240 245 

Ser Asp Ala Ser Gin Leu Ser Pro Phe Cys Gin Leu Phe Thr His Asn 

250 255 260 

Glu Trp Lys Lys Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 
265 270 275 

Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly He Gly Phe Thr 
280 285 290 

Asn Glu Leu He Ala Arg Leu Thr Arg Ser Pro Val Gin Asp His Thr 
295 300 305 310 

Ser Thr Asn Ser Thr Leu Val Ser Asn Pro Ala Thr Phe Pro Leu Asn 

315 320 325 

Ala Thr Met Tyr Val Asp Phe Ser His Asp Asn Ser Met Val Ser He 

330 335 340 

Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Glu Pro Leu Ser Arg Thr 
345 350 355 

Ser Val Glu Ser Ala Lys Glu Leu Asp Gly Tyr Ser Ala Ser Trp Val 
360 365 370 

Val Pro Phe Gly Ala Arg Ala Tyr Phe Glu Thr Met Gin Cys Lys Ser 
375 380 385 390 

Glu Lys Glu Pro Leu Val Arg Ala Leu He Asn Asp Arg Val Val Pro 

395 400 405 

Leu His Gly Cys Asp Val Asp Lys Leu Gly Arg Cys Lys Leu Asn Asp 

410 415 420 

Phe Val Lys Gly Leu Ser Trp Ala Arg Ser Gly Gly Asn Trp Gly Glu 
425 430 435 
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Cys Phe Ser 
440 



<210> 34 
<211> 1426 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 7 

<220> 

<221> CDS 

<222> (12) . . (1412) 

<220> 

<22l> mat_j?eptide 
<222> (90) . . (1412) 

<220> 

<221> sig_jpeptide 
<222> (12) . . (89) 

<400> 34 

tatatgaatt c atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg 50 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu 
-25 -20 -15 

ttc ggt tec aca tec ggt acc gec ttg ggt cct cgt ggt aat tct cac 98 
Phe Gly Ser Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His 

-10 -5 -1 l 

tct tgt gac act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct 146 
Ser Cys Asp Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser 
5 10 15 

cac ttg tgg ggt caa tac tct cca tac ttc tct ttg gaa gac gaa tct 194 
His Leu Trp Gly Gin Tyr Ser Pro Tyr Phe Ser Leu Glu Asp Glu Ser 
20 25 30 35 

get att tct cca gac gtt cca gac gac tgt aga gtt act ttc gtt caa 242 
Ala He Ser Pro Asp Val Pro Asp Asp Cys Arg Val Thr Phe Val Gin 

40 45 50 

gtt ttg tct aga cac ggt get aga tac cca act gac tct aag ggt aag 290 
Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Asp Ser Lys Gly Lys 

55 60 65 

aag tac tct get ttg att gaa get att caa aag aac get act get ttc 338 
Lys Tyr Ser Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe 
70 75 80 
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aa 9 99t aa g tac 
Lys Gly Lys Tyr 
85 

gac gac ttg act 
Asp Asp Leu Thr 
100 

aag ttc tac aga 
Lys Phe Tyr Arg 



att aga get tct 
He Arg Ala Ser 

135 

att gaa ggt ttc 
He Glu Gly Phe 
150 

cac caa get tct 
His Gin Ala Ser 
165 

tac aac aac act 
Tyr Asn Asn Thr 
180 

gaa ttg get gac 
Glu Leu Ala Asp 



get att aga get 
Ala He Arg Ala 

215 

gac act gaa gtt 
Asp Thr Glu Val 
230 

get aga act tct 
Ala Arg Thr Ser 
245 

act cac gac gaa 
Thr His Asp Glu 
260 

tac tac ggt cac 
Tyr Tyr Gly His 



get ttc ttg 
Ala Phe Leu 
90 

cca ttc ggt 
Pro Phe Gly 
105 

aga tac aag 
Arg Tyr Lys 
120 

ggt tct tct 
Gly Ser Ser 



caa tct get 
Gin Ser Ala 



cca gtt att 
Pro Val He 
170 

ttg gac cca 
Leu Asp Pro 
185 

act gtt gaa 
Thr Val Glu 
200 

aga ttg gaa 
Arg Leu Glu 



act tac ttg 
Thr Tyr Leu 



gac get act 
Asp Ala Thr 
250 

tgg aga cac 
Trp Arg His 
265 

ggt get ggt 
Gly Ala Gly 
280 



aag act tac 
Lys Thr Tyr 



gaa aac caa 
Glu Asn Gin 



get ttg get 
Ala Leu Ala 
125 

aga gtt att 
Arg Val He 
140 

aag ttg get 
Lys Leu Ala 
155 

gac gtt att 
Asp Val He 



ggt act tgt 
Gly Thr Cys 



get aac ttc 
Ala Asn Phe 
205 

get gac ttg 
Ala Asp Leu 
220 

atg gac atg 
Met Asp Met 
235 

gaa ttg tct 
Glu Leu Ser 



tac gac tac 
Tyr Asp Tyr 



aac cca ttg 
Asn Pro Leu 
285 



aac tac act ttg 
Asn Tyr Thr Leu 
95 

atg gtt aac tct 
Met Val Asn Ser 
110 

aga aag att gtt 
Arg Lys He Val 



get tct get gaa 
Ala Ser Ala Glu 

145 

gac cca ggt tct 
Asp Pro Gly Ser 
160 

att tct gac get 
He Ser Asp Ala 
175 

act get ttc gaa 
Thr Ala Phe Glu 
190 

act get ttg ttc 
Thr Ala Leu Phe 



cca ggt gtt act 
Pro Gly Val Thr 

225 

tgt tct ttc gaa 
Cys Ser Phe Glu 
240 

cca ttc tgt get 
Pro Phe Cys Ala 
255 

ttg caa tct ttg 
Leu Gin Ser Leu 
270 

ggt cca act caa 
Gly Pro Thr Gin 



ggt get 386 
Gly Ala 



ggt att 434 
Gly He 
115 

cca ttc 482 

Pro Phe 

130 

aag ttc 530 
Lys Phe 



caa cca 578 
Gin Pro 



tct tct 626 
Ser Ser 



gac tct 674 
Asp Ser 
195 

get cca 722 

Ala Pro 

210 

ttg act 770 
Leu Thr 



act gtt 818 
Thr Val 



ttg ttc 866 
Leu Phe 



aag aag 914 
Lys Lys 
275 

ggt gtt 962 

Gly Val 

290 
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ggt ttc get aac gaa ttg att get aga ttg act aga tct cca gtt caa 1010 
Gly Phe Ala Asn Glu Leu He Ala Arg Leu Thr Arg Ser Pro Val Gin 

295 300 305 

gac cac act tct act aac cac act ttg gac tct aac cca get act ttc 1058 
Asp His Thr Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe 
310 315 320 

cca ttg aac get act ttg tac get gac ttc tct cac gac aac ggt att 1106 
Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Gly He 
325 330 335 

att tct att ttc ttc get ttg ggt ttg tac aac ggt act get cca ttg 1154 
He Ser He Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Ala Pro Leu 
340 345 350 355 

tct act act tct gtt gaa tct att gaa gaa act gac ggt tac tct tct 1202 
Ser Thr Thr Ser Val Glu Ser He Glu Glu Thr Asp Gly Tyr Ser Ser 

360 365 370 

get tgg act gtt cca ttc get tct aga get tac gtt gaa atg atg caa 1250 
Ala Trp Thr Val Pro Phe Ala Ser Arg Ala Tyr Val Glu Met Met Gin 

375 380 385 

tgt caa get gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga 1298 
Cys Gin Ala Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg 
390 395 400 

gtt gtt cca ttg cac ggt tgt get gtt gac aag ttg ggt aga tgt aag 1346 
Val Val Pro Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys 
405 410 415 

aga gac gac ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac 1394 
Arg Asp Asp Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn 
420 425 430 435 

tgg get gaa tgt ttc get taagaattca tata 1426 
Trp Ala Glu Cys Phe Ala 

440 



<210> 35 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: consensus 
phytase 7 

<400> 35 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 
-25 -20 -is 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-10 -5 -11 5 
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Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 

10 15 20 

Gly Gin Tyr Ser Pro Tyr Phe Ser Leu Glu Asp Glu Ser Ala He Ser 
25 30 35 

Pro Asp Val Pro Asp Asp Cys Arg Val Thr Phe Val Gin Val Leu Ser 
40 45 50 

Arg His Gly Ala Arg Tyr Pro Thr Asp Ser Lys Gly Lys Lys Tyr Ser 
55 60 65 70 

Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 

75 80 85 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 

90 95 100 

Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly He Lys Phe Tyr 
105 110 115 

Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 
120 125 130 

Ser Gly Ser Ser Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
135 140 145 150 

Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ser Gin Pro His Gin Ala 

155 160 165 

Ser Pro Val lie Asp Val He He Ser Asp Ala Ser Ser Tyr Asn Asn 

170 175 180 

Thr Leu Asp Pro Gly Thr Cys Thr Ala Phe Glu Asp Ser Glu Leu Ala 
185 190 195 

Asp Thr Val Glu Ala Asn Phe Thr Ala Leu Phe Ala Pro Ala He Arg 
200 205 210 

Ala Arg Leu Glu Ala Asp Leu Pro Gly Val Thr Leu Thr Asp Thr Glu 
215 220 225 230 

Val Thr Tyr Leu Met Asp Met Cys Ser Phe Glu Thr Val Ala Arg Thr 

235 240 245 

Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Ala Leu Phe Thr His Asp 

250 255 260 

Glu Trp Arg His Tyr Asp Tyr Leu Gin Ser Leu Lys Lys Tyr Tyr Gly 
265 270 275 



His Gly Ala Gly Asn Pro Leu Gly Pro Thr Gin Gly Val Gly Phe Ala 
280 285 290 
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Asn Glu Leu lie 
295 

Ser Thr Asn His 



Ala Thr Leu Tyr 

330 

Phe Phe Ala Leu 
345 

Ser Val Glu Ser 
360 

Val Pro Phe Ala 
375 

Glu Lys Glu Pro 



Leu His Gly Cys 

410 

Phe Val Glu Gly 
425 

Cys Phe Ala 
440 



Ala Arg Leu Thr 
300 

Thr Leu Asp Ser 
315 

Ala Asp Phe Ser 



Gly Leu Tyr Asn 

350 

He Glu Glu Thr 
365 

Ser Arg Ala Tyr 
380 

Leu Val Arg Val 
395 

Ala Val Asp Lys 



Leu Ser Phe Ala 

430 



Arg Ser Pro Val 
305 

Asn Pro Ala Thr 
320 

His Asp Asn Gly 
335 

Gly Thr Ala Pro 



Asp Gly Tyr Ser 

370 

Val Glu Met Met 
385 

Leu Val Asn Asp 
400 

Leu Gly Arg Cys 
415 

Arg Ser Gly Gly 



Gin Asp His Thr 

310 

Phe Pro Leu Asn 
325 

He He Ser He 
340 

Leu Ser Thr Thr 
355 

Ser Ala Trp Thr 



Gin Cys Gin Ala 

390 

Arg Val Val Pro 
405 

Lys Arg Asp Asp 
420 

Asn Trp Ala Glu 
435 



<210> 36 
<211> 467 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
phytase 12 

<400> 36 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 
15 10 15 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 

20 25 30 

Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser Ser Asn Trp 
35 40 45 



Ser Pro Tyr Ser Pro Tyr Phe Ser Leu Ala Asp Glu Ser Ala He Ser 
50 55 60 
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Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin Val Leu Gin 
65 70 75 80 

Arg His Gly Ala Arg Phe Pro Thr Ser Gly Ala Ala Thr Arg He Ser 

85 90 95 

Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 

100 105 110 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
115 120 125 

Val Pro Phe Gly Ala Asn Gin Ser Ser Gin Ala Gly He Lys Phe Tyr 
130 135 140 

Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 
145 150 155 160 

Ser Gly Ser Asp Arg Val He Asp Ser Ala Thr Asn Trp He Glu Gly 

165 170 175 

Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ala Asn Pro His Gin Ala 

180 185 190 

Ser Pro Val He Asn Val He He Pro Glu Gly Ala Gly Tyr Asn Asn 
195 200 205 

Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Glu Ser Glu Leu Gly 
210 215 220 

Asp Asp Val Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro He Arg 
225 230 235 240 

Ala Arg Leu Glu Ala His Leu Pro Gly Val Asn Leu Thr Asp Glu Asp 

245 250 255 

Val Val Asn Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 

260 265 270 

Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Asp Leu Phe Thr His Asp 
275 280 285 

Glu Trp He Gin Tyr Asp Tyr Leu Gly Asp Leu Asp Lys Tyr Tyr Gly 
290 295 300 

Thr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val 
305 310 315 320 

Asn Glu Leu He Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 

325 330 335 

Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 

340 345 350 
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Ala Thr Leu Tyr 
355 

Phe Phe Ala Leu 
370 

Ser Val Glu Ser 
385 

Val Pro Phe Ser 



Glu Lys Glu Pro 

420 

Leu His Gly Cys 
435 

Phe Val Glu Gly 
450 

Cys Phe Ala 
465 



Ala Asp Phe Ser 

360 

Gly Leu Tyr Asn 
375 

lie Glu Glu Thr 
390 

Ala Arg Met Tyr 
405 

Leu Val Arg Val 



Gly Val Asp Lys 

440 

Leu Ser Phe Ala 
455 



His Asp Asn Thr 



Gly Thr Lys Pro 

380 

Asp Gly Tyr Ser 
395 

Val Glu Met Met 
410 

Leu Val Asn Asp 
425 

Leu Gly Arg Cys 



Arg Ser Gly Gly 

460 



Met Val Ala He 
365 

Leu Ser Thr Thr 



Ala Ser Trp Leu 

400 

Gin Cys Glu Ala 
415 

Arg Val Val Pro 
430 

Lys Arg Asp Asp 
445 

Asn Trp Glu Glu 



<210> 37 

<211> 26 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

tatatgaatt catgggcgtg ttcgtc 



<210> 38 

<211> 22 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

tgaaaagttc attgaaggtt tc 



<210> 39 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

tcttcgaaag cagtacacaa ac 



<210> 40 

<211> 22 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 40 

tatatgaatt cttaagcgaa ac 



<210> 41 

<211> 32 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

cacttgtggg gtacctactc tccatacttc tc 



<210> 42 

<211> 31 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

ggtcaatact ctccattctt ctctttggaa g 



<210> 43 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 43 

catacttctc tttggcagac gaatctgc 



28 
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<210> 44 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 44 

ctccagacgt cccaaaggac tgtagagtta c 

<210> 45 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 45 

ctccagacgt cccagacggc tgtagagtta c 

<210> 46 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 46 

gatacccaac ttcttctgcg tctaaggctt actctg 

<210> 47 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 47 

cttctaagtc taagaagtac tctgctttg 

<210> 48 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 



<400> 48 

gcttactctg ctttgattga acggattcaa aagaacgcta c 



41 



<210> 49 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 49 

ccattcggtg aacagcaaat ggttaactc 29 



<210> 50 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 51 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 51 

gattgttcca ttcgtgcgcg cttctggttc 30 

<210> 52 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 50 

gatacaaggc tctcgcgaga aacattgttc 



30 



<400> 52 

ctccagttat taacgtgatc attccagaag g 



31 
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<210> 53 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 53 

ggctgaccca ggggcccaac cacaccaagc 3 0 



<210> 54 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 54 

cactttggac catggtcttt gtactgcttt eg 32 



<210> 55 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 55 

gctttcgaag actctaccct aggtgacgac gttg 34 



<210> 56 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 56 

ggtgacgacg ctgaagctaa cttcac 26 



<210> 57 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 57 

ctaacttcac cgcggtgttc gctccag 27 



<210> 58 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 58 

gctttgttcg ctccacctat tagagctaga ttgg 34 



<210> 59 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 59 

gccaggtgtt aacttgactg acgaag 26 



<210> 60 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 60 

gacgaagacg tcgttaactt gatggac 27 

<210> 61 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 61 

gtccattcga cactgtcgct agaacttc 28 
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<210> 62 

<211> 26 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 62 

ctgacgctac tcagctgtct ccattc 



<210> 63 

<211> 28 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 63 

gtctccattc tgtgatttgt tcactcac 



<210> 64 

<211> 26 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 64 

gctttgttca ccgcggacga atggag 



<210> 65 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 65 

cacgacgaat ggatccaata cgactac 



<210> 66 

<211> 27 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 66 

gacgaatgga gagcgtacga ctacttg 



<210> 67 

<211> 29 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 67 

ggtgttggtt tcgttaacga attgattgc 



<210> 68 

<211> 28 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 68 

gctagattga ctcactctcc agttcaag 



<210> 69 

<211> 32 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 69 

ctcacgacaa cactatgata tctattttct tc 32 



<210> 70 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 70 

cgacaactcc atggtttcta ttttcttcgc 
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<210> 71 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 71 

gtacaacggt accaagccat tgtctac 27 



<210> 72 

<211> 25 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 72 

ctgacggtta cgctgcttct tggac 



<210> 73 

<211> 26 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 73 

ctgttccatt cgctgctaga gcttac 



<210> 74 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 74 

gatgcaatgt gaagctgaaa aggaacc 27 



<210> 75 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 75 

cacggttgtg gtgtcgacaa gttggg 26 

<210> 76 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 76 

gatctggtgg caattgggag gaatgtttcg 

<210> 77 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 77 

cacgtactcg ccatactttt cgctcgag 28 

<210> 78 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 78 

ccatactttt cgctcgcgga cgagctgtcc gtg 33 

<210> 79 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 79 

gtataagaag cttattacgg cgatccaggc c 



31 
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<210> 80 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 80 

cttcaagggc aagtacgcct ttttgaagac g 



<210> 81 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 81 

catccgagct cgcctcgaga agcatcttc 



<210> 82 

<211> 29 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 82 

ctaatggatg tgtccgtttg atacggtag 



<210> 83 

<211> 27 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 83 

gtggaagaag tacgactacc ttcagtc 



<210> 84 

<211> 28 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 84 

gcccggttga cgcattcgcc agtgcagg 

<210> 85 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 85 

cacacgacaa caccatggtt tccatcttc 

<210> 86 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 86 

gtggtgcctt tcgccgcgcg agcctacttc 

<210> 87 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 87 

tatatcatga gcgtgttcgt cgtgctactg ttc 

<210> 88 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 88 

acccgactta caaagcgaat tctatagata tat 



33 
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<210> 89 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 89 

acccttctta caaagcgaat tctatagata tat 3 3 

<210> 90 
<211> 1404 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Consensus -phytase - 3 - t hermo- 1 1 -Q50T 

<220> 

<221> CDS 

<222> (1) . . (1401) 

<220> 

<221> sig__peptide 
<222> (1) . . (69) 

<220> 

<221> mat_peptide 
<222> (70) . . (1401) 

<400> 90 

atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg ttc ggt tec 4 8 
Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 

-20 -15 -10 

aca tec ggt acc gec ttg ggt cct cgt ggt aat tct cac tct tgt gac 96 
Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct cac ttg tgg 144 
Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 
10 15 20 25 

ggt acc tac tct cca tac ttc tct ttg gca gac gaa tct get att tct 192 
Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Ala Asp Glu Ser Ala He Ser 

30 35 40 



cca gac gtc cca aag gac tgt aga gtt act ttc gtt caa gtt ttg tct 240 
Pro Asp Val Pro Lys Asp Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 
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aga cac ggt get aga tac cca act tct tct aag tct aag get tac tct 2 88 
Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Ala Tyr Ser 
60 65 70 

get ttg att gaa get att caa aag aac get act get ttc aag ggt aag 336 
Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 

tac get ttc ttg aag act tac aac tac act ttg ggt get gac gac ttg 384 
Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
90 95 ioo 105 

act cca ttc ggt gaa aac caa atg gtt aac tct ggt att aag ttc tac 432 
Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly He Lys Phe Tyr 

HO us 120 

aga aga tac aag get ttg get aga aag att gtt cca ttc att aga get 480 
Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 

125 130 135 

tct ggt tct gac aga gtt att get tct get gaa aag ttc att gaa ggt 528 
Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
140 145 150 

ttc caa tct get aag ttg get gac cca ggt tct caa cca cac caa get 576 
Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ser Gin Pro His Gin Ala 
155 160 165 

tct cca gtt att aac gtg ate att cca gaa gga tec ggt tac aac aac 624 
Ser Pro Val He Asn Val He He Pro Glu Gly Ser Gly Tyr Asn Asn 
170 175 180 185 

act ttg gac cat ggt ctt tgt act get ttc gaa gac tct acc eta ggt 672 
Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Asp Ser Thr Leu Gly 

190 195 200 

gac gac gtt gaa get aac ttc act get ttg ttc get cca get att aga 720 
Asp Asp Val Glu Ala Asn Phe Thr Ala Leu Phe Ala Pro Ala He Arg 

205 210 215 

get aga ttg gaa get gac ttg cca ggt gtt act ttg act gac gaa gac 768 
Ala Arg Leu Glu Ala Asp Leu Pro Gly Val Thr Leu Thr Asp Glu Asp 
220 225 230 

gtt gtt tac ttg atg gac atg tgt cca ttc gac act gtc get aga act 816 
Val Val Tyr Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 
235 240 245 

tct gac get act gaa ttg tct cca ttc tgt get ttg ttc act cac gac 864 
Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Ala Leu Phe Thr His Asp 
250 255 260 265 
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gaa tgg ate caa tac gac tac ttg caa age ttg ggt aag tac tac ggt 912 
Glu Trp He Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 

270 275 280 

tac ggt get ggt aac cca ttg ggt cca get caa ggt gtt ggt ttc get 960 
Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Ala 

285 290 295 

aac gaa ttg att get aga ttg act cac tct cca gtt caa gac cac act 1008 
Asn Glu Leu He Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
300 305 310 

tct act aac cac act ttg gac tct aac cca get act ttc cca ttg aac 1056 
Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 
315 320 325 

get act ttg tac get gac ttc tct cac gac aac act atg ata tct att 1104 
Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met He Ser He 
330 335 340 345 

ttc ttc get ttg ggt ttg tac aac ggt ace aag cca ttg tct act act 1152 
Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 

350 355 360 

tct gtt gaa tct att gaa gaa act gac ggt tac tct get tct tgg act 1200 
Ser Val Glu Ser He Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 

365 370 375 

gtt cca ttc get get aga get tac gtt gaa atg atg caa tgt caa get 1248 
Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin Cys Gin Ala 
380 385 390 

gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga gtt gtt cca 1296 
Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 
395 400 405 

ttg cac ggt tgt get gtt gac aag ttg ggt aga tgt aag aga gac gac 1344 
Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys Arg Asp Asp 
410 415 420 425 

ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac tgg get gaa 1392 
Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Ala Glu 

430 435 440 

tgt ttc get taa 1404 
Cys Phe Ala 



<210> 91 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: 
Consensus - phytase - 3 - the rmo - 1 1 -Q5 0T 
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<400> 91 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 

-20 -15 -io 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu lie Ser His Leu Trp 
1° 15 20 25 

Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Ala Asp Glu Ser Ala He Ser 

30 35 40 

Pro Asp Val Pro Lys Asp Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 

Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Ala Tyr Ser 
60 65 70 

Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
90 95 100 105 

Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly He Lys Phe Tyr 

HO 115 120 

Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 

125 130 135 

Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
140 145 150 

Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ser Gin Pro His Gin Ala 
155 160 165 

Ser Pro Val He Asn Val He He Pro Glu Gly Ser Gly Tyr Asn Asn 
i7 ° 175 180 185 

Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Asp Ser Thr Leu Gly 

190 195 200 

Asp Asp Val Glu Ala Asn Phe Thr Ala Leu Phe Ala Pro Ala He Arg 

205 210 215 

Ala Arg Leu Glu Ala Asp Leu Pro Gly Val Thr Leu Thr Asp Glu Asp 
220 225 230 

Val Val Tyr Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 
235 240 245 



WO 00/43503 



PCT/DK00/00025 



85 

Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Ala Leu Phe Thr His Asp 
250 255 260 265 

Glu Trp lie Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 

270 275 280 

Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Ala 

285 290 295 

Asn Glu Leu lie Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
300 305 310 

Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 
315 320 325 

Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met lie Ser He 
330 335 340 345 

Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 

350 355 360 

Ser Val Glu Ser He Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 

365 370 375 

Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin Cys Gin Ala 
380 385 390 

Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 
395 400 405 

Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys Arg Asp Asp 
410 415 420 425 

Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Ala Glu 

430 435 440 

Cys Phe Ala 



<210> 92 
<211> 1404 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
phy t ase - 3 - 1 hermo - 1 1 - Q5 0T- K9 1A 

<220> 

<221> sig__peptide 
<222> (1) . . (69) 
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<220> 

<221> CDS 

<222> (1) . . (1401) 

<220> 

<221> mat_peptide 
<222> (70) . . (1401) 

<400> 92 

atg ggc gtg ttc gtc gtg eta ctg tec att gee ace ttg ttc ggt tec 4 8 
Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 

-20 -15 -io 

aca tec ggt ace gee ttg ggt cct cgt ggt aat tct cac tct tgt gac 96 
Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct cac ttg tgg 144 
Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 
10 15 20 25 

ggt acc tac tct cca tac ttc tct ttg gca gac gaa tct get att tct 192 
Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Ala Asp Glu Ser Ala He Ser 

30 35 40 

cca gac gtc cca aag gac tgt aga gtt act ttc gtt caa gtt ttg tct 24 0 
Pro Asp Val Pro Lys Asp Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 

aga cac ggt get aga tac cca act tct tct gcg tct aag get tac tct 2 88 
Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
60 65 70 

get ttg att gaa get att caa aag aac get act get ttc aag ggt aag 336 
Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 

tac get ttc ttg aag act tac aac tac act ttg ggt get gac gac ttg 384 
Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
90 95 100 105 

act cca ttc ggt gaa aac caa atg gtt aac tct ggt att aag ttc tac 432 
Thr Pro Phe Gly Glu Asn Gin Met Val Asn Ser Gly He Lys Phe Tyr 

HO 115 120 

aga aga tac aag get ttg get aga aag att gtt cca ttc att aga get 480 
Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 

125 130 135 

tct ggt tct gac aga gtt att get tct get gaa aag ttc att gaa ggt 528 
Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
140 145 150 
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ttc caa tct get aag ttg get gac cca ggt tct caa cca cac caa get 576 
Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ser Gin Pro His Gin Ala 
155 160 165 

tct cca gtt att aac gtg ate att cca gaa gga tec ggt tac aac aac 624 
Ser Pro Val lie Asn Val He He Pro Glu Gly Ser Gly Tyr Asn Asn 
170 175 180 185 

act ttg gac cat ggt ctt tgt act get ttc gaa gac tct ace eta ggt 672 
Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Asp Ser Thr Leu Gly 

190 195 200 

gac gac gtt gaa get aac ttc act get ttg ttc get cca get att aga 720 
Asp Asp Val Glu Ala Asn Phe Thr Ala Leu Phe Ala Pro Ala He Arg 

205 210 215 

get aga ttg gaa get gac ttg cca ggt gtt act ttg act gac gaa gac 768 
Ala Arg Leu Glu Ala Asp Leu Pro Gly Val Thr Leu Thr Asp Glu Asp 
220 225 230 

gtt gtt tac ttg atg gac atg tgt cca ttc gac act gtc get aga act 816 
Val Val Tyr Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 
235 240 245 

tct gac get act gaa ttg tct cca ttc tgt get ttg ttc act cac gac 864 
Ser Asp Ala Thr Glu Leu Ser Pro Phe Cys Ala Leu Phe Thr His Asp 
250 255 260 265 

gaa tgg ate caa tac gac tac ttg caa age ttg ggt aag tac tac ggt 912 
Glu Trp He Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 

270 275 280 

tac ggt get ggt aac cca ttg ggt cca get caa ggt gtt ggt ttc get 960 
Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Ala 

285 290 295 

aac gaa ttg att get aga ttg act cac tct cca gtt caa gac cac act 1008 
Asn Glu Leu He Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
300 305 310 

tct act aac cac act ttg gac tct aac cca get act ttc cca ttg aac 1056 
Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 
315 320 325 

get act ttg tac get gac ttc tct cac gac aac act atg ata tct att 1104 
Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met He Ser He 
330 335 340 345 

ttc ttc get ttg ggt ttg tac aac ggt ace aag cca ttg tct act act 1152 
Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 

350 355 360 
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tct gtt gaa tct att gaa gaa act gac ggt tac tct get tct tgg act 1200 

Ser Val Glu Ser lie Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 

365 370 375 

gtt cca ttc get get aga get tac gtt gaa atg atg caa tgt caa get 1248 

Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin Cys Gin Ala 

380 385 390 



gaa aag gaa cca ttg gtt 
Glu Lys Glu Pro Leu Val 
395 

ttg cac ggt tgt get gtt 
Leu His Gly Cys Ala Val 
410 415 



aga gtt ttg gtt aac gac 
Arg Val Leu Val Asn Asp 
400 405 

gac aag ttg ggt aga tgt 
Asp Lys Leu Gly Arg Cys 

420 



aga gtt gtt cca 1296 
Arg Val Val Pro 



aag aga gac gac 1344 
Lys Arg Asp Asp 

425 



ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac tgg get gaa 1392 
Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Ala Glu 

430 435 440 

tgt ttc get taa 14 04 

Cys Phe Ala 



<210> 93 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Consensus 
phytase-3-thermo-ll-Q50T-K91A 

<400> 93 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 

-20 -15 -10 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu lie Ser His Leu Trp 
10 15 20 25 

Gly Thr Tyr Ser Pro Tyr Phe Ser Leu Ala Asp Glu Ser Ala He Ser 

30 35 40 

Pro Asp Val Pro Lys Asp Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 

Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
60 65 70 

Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 
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Tyr Ala Phe Leu 
90 

Thr Pro Phe Gly 



Arg Arg Tyr Lys 

125 

Ser Gly Ser Asp 
140 

Phe Gin Ser Ala 
155 

Ser Pro Val lie 
170 

Thr Leu Asp His 



Asp Asp Val Glu 

205 

Ala Arg Leu Glu 
220 

Val Val Tyr Leu 
235 

Ser Asp Ala Thr 
250 

Glu Trp lie Gin 



Tyr Gly Ala Gly 

285 

Asn Glu Leu lie 
300 

Ser Thr Asn His 
315 

Ala Thr Leu Tyr 
330 

Phe Phe Ala Leu 



Ser Val Glu Ser 

365 



Lys Thr Tyr Asn 
95 

Glu Asn Gin Met 
110 

Ala Leu Ala Arg 



Arg Val lie Ala 

145 

Lys Leu Ala Asp 
160 

Asn Val lie lie 
175 

Gly Leu Cys Thr 
190 

Ala Asn Phe Thr 



Ala Asp Leu Pro 

225 

Met Asp Met Cys 
240 

Glu Leu Ser Pro 
255 

Tyr Asp Tyr Leu 
270 

Asn Pro Leu Gly 



Ala Arg Leu Thr 

305 

Thr Leu Asp Ser 
320 

Ala Asp Phe Ser 
335 

Gly Leu Tyr Asn 
350 

He Glu Glu Thr 



89 

Tyr Thr Leu Gly 
100 

Val Asn Ser Gly 
115 

Lys He Val Pro 
130 

Ser Ala Glu Lys 



Pro Gly Ser Gin 

165 

Pro Glu Gly Ser 
180 

Ala Phe Glu Asp 
195 

Ala Leu Phe Ala 
210 

Gly Val Thr Leu 



Pro Phe Asp Thr 

245 

Phe Cys Ala Leu 
260 

Gin Ser Leu Gly 
275 

Pro Ala Gin Gly 
290 

His Ser Pro Val 



Asn Pro Ala Thr 

325 

His Asp Asn Thr 
340 

Gly Thr Lys Pro 
355 

Asp Gly Tyr Ser 
370 



Ala Asp Asp Leu 

105 

He Lys Phe Tyr 
120 

Phe He Arg Ala 
135 

Phe He Glu Gly 
150 

Pro His Gin Ala 



Gly Tyr Asn Asn 

185 

Ser Thr Leu Gly 
200 

Pro Ala He Arg 
215 

Thr Asp Glu Asp 
230 

Val Ala Arg Thr 



Phe Thr His Asp 

265 

Lys Tyr Tyr Gly 
280 

Val Gly Phe Ala 
295 

Gin Asp His Thr 
310 

Phe Pro Leu Asn 



Met He Ser He 

345 

Leu Ser Thr Thr 
360 

Ala Ser Trp Thr 
375 
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Val Pro Phe Ala 
380 

Glu Lys Glu Pro 
395 

Leu His Gly Cys 
410 

Phe Val Glu Gly 



Cys Phe Ala 



Ala Arg Ala Tyr 

385 

Leu Val Arg Val 
400 

Ala Val Asp Lys 
415 

Leu Ser Phe Ala 
430 



90 

Val Glu Met Met 



Leu Val Asn Asp 

405 

Leu Gly Arg Cys 
420 

Arg Ser Gly Gly 
435 



Gin Cys Gin Ala 
390 

Arg Val Val Pro 



Lys Arg Asp Asp 

425 

Asn Trp Ala Glu 
440 



<210> 94 
<211> 1404 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
phytase-10-thermo-5-Q50T 

<220> 

<221> sig_peptide 
<222> (1) . . (69) 

<220> 

<221> CDS 

<222> (1) . . (1401) 

<220> 

<221> mat_peptide 
<222> (70) . . (1401) 

<400> 94 

atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg ttc ggt tec 48 
Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 

-20 -15 -io 

aca tec ggt acc gec ttg ggt cct cgt ggt aat tct cac tct tgt gac 96 
Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct cac ttg tgg 144 
Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu lie Ser His Leu Trp 
10 15 20 25 

ggt aca tac tct cca ttc ttc tct ttg get gac gaa tct get att tct 192 
Gly Thr Tyr Ser Pro Phe Phe Ser Leu Ala Asp Glu Ser Ala lie Ser 

30 35 40 
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cca gac gtt cca aag ggt tgt aga gtt act ttc gtt caa gtt ttg tct 240 
Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 

aga cac ggt get aga tac cca act tct tct aag tct aag get tac tct 288 
Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Ala Tyr Ser 
60 65 70 

get ttg att gaa get att caa aag aac get act get ttc aag ggt aag 336 
Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 

tac get ttc ttg aag act tac aat tac act ttg ggt get gac gac ttg 384 
Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
90 95 100 105 

act cca ttc ggt gaa caa caa atg gtt aac tct ggt att aag ttc tac 432 
Thr Pro Phe Gly Glu Gin Gin Met Val Asn Ser Gly He Lys Phe Tyr 

HO 115 120 

aga aga tac aag get ttg get aga aag att gtt cca ttc att aga get 480 
Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 

125 130 135 

tct ggt tct gac aga gtt att get tct gee gaa aag ttc att gaa ggt 528 
Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
140 145 150 

ttc caa tct get aag ttg get gac cca ggt get aac cca cac caa get 576 
Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ala Asn Pro His Gin Ala 
155 160 165 

tct cca gtt att aac gtt att att cca gaa ggt get ggt tac aac aac 624 
Ser Pro Val He Asn Val He He Pro Glu Gly Ala Gly Tyr Asn Asn 
170 175 180 185 

act ttg gac cac ggt ttg tgt act get ttc gaa gaa tct ace eta ggt 672 
Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Glu Ser Thr Leu Gly 

190 195 200 

gac gac gtt gaa get aac ttc act get gtt ttc get cca cca att aga 720 
Asp Asp Val Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro He Arg 

205 210 215 

get aga ttg gaa get cac ttg cca ggt gtt aac ttg act gac gaa gac 768 
Ala Arg Leu Glu Ala His Leu Pro Gly Val Asn Leu Thr Asp Glu Asp 
220 225 230 

gtt gtt aac ttg atg gac atg tgt cca ttc gac act gtt get aga act 816 
Val Val Asn Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 
235 240 " 245 
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tct gac get act caa ttg tct cca ttc tgt gac ttg ttc act cac gac 864 
Ser Asp Ala Thr Gin Leu Ser Pro Phe Cys Asp Leu Phe Thr His Asp 
250 255 260 265 

gaa tgg att caa tac gac tac ttg caa tct ttg ggt aag tac tac ggt 912 
Glu Trp lie Gin Tyr Asp Tyx Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 

270 275 280 

tac ggt get ggt aac cca ttg ggt cca get caa ggt gtt ggt ttc gtt 96 0 
Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val 

285 290 295 

aac gaa ttg att get aga ttg act cac tct cca gtt caa gac cac act 1008 
Asn Glu Leu lie Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
300 305 310 

tct act aac cac act ttg gac tct aac cca get act ttc cca ttg aac 1056 
Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 
315 320 325 

get act ttg tac get gac ttc tct cac gac aac act atg gtt tct att 1104 
Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met Val Ser lie 
330 335 340 345 

ttc ttc get ttg ggt ttg tac aac ggt act aag cca ttg tct act act 1152 
Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 

350 355 360 

tct gtt gaa tct att gaa gaa act gac ggt tac tct get tct tgg act 12 00 
Ser Val Glu Ser lie Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 

365 370 375 

gtt cca ttc get get aga get tac gtt gaa atg atg caa tgt gaa get 1248 
Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin Cys Glu Ala 
380 385 390 

gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga gtt gtt cca 1296 
Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 
395 400 405 

ttg cac ggt tgt get gtt gac aag ttg ggt aga tgt aag aga gac gac 1344 
Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys Arg Asp Asp 
410 415 420 425 

ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac tgg gaa gaa 1392 
Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Glu Glu 

430 435 440 

tgt ttc get taa 1404 
Cys Phe Ala 



<210> 95 
<211> 467 



WO 00/43503 



PCT/DK00/00025 



93 

<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Consensus 
phy t as e - 1 0 - thermo- 5 - Q5 OT 

<400> 95 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 

-20 -15 -10 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 
10 15 20 25 

Gly Thr Tyr Ser Pro Phe Phe Ser Leu Ala Asp Glu Ser Ala He Ser 

30 35 40 

Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 

Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Ala Tyr Ser 
60 65 70 

Ala Leu He Glu Ala He Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
90 95 100 105 

Thr Pro Phe Gly Glu Gin Gin Met Val Asn Ser Gly lie Lys Phe Tyr 

110 115 120 

Arg Arg Tyr Lys Ala Leu Ala Arg Lys He Val Pro Phe He Arg Ala 

125 130 135 

Ser Gly Ser Asp Arg Val He Ala Ser Ala Glu Lys Phe He Glu Gly 
140 145 150 

Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ala Asn Pro His Gin Ala 
155 160 165 

Ser Pro Val He Asn Val He He Pro Glu Gly Ala Gly Tyr Asn Asn 
170 175 180 185 

Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Glu Ser Thr Leu Gly 

190 195 200 

Asp Asp Val Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro He Arg 

205 210 215 



Ala Arg Leu Glu Ala His Leu Pro Gly Val Asn Leu Thr Asp Glu Asp 
220 225 230 
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Val Val Asn Leu Met 
235 

Ser Asp Ala Thr Gin 
250 

Glu Trp lie Gin Tyr 

270 

Tyr Gly Ala Gly Asn 

285 

Asn Glu Leu lie Ala 
300 

Ser Thr Asn His Thr 
315 

Ala Thr Leu Tyr Ala 
330 

Phe Phe Ala Leu Gly 

350 

Ser Val Glu Ser lie 

365 

Val Pro Phe Ala Ala 
380 

Glu Lys Glu Pro Leu 
395 

Leu His Gly Cys Ala 
410 

Phe Val Glu Gly Leu 

430 

Cys Phe Ala 



94 

Asp Met Cys Pro Phe Asp 
240 

Leu Ser Pro Phe Cys Asp 
255 260 

Asp Tyr Leu Gin Ser Leu 

275 

Pro Leu Gly Pro Ala Gin 

290 

Arg Leu Thr His Ser Pro 
305 

Leu Asp Ser Asn Pro Ala 
320 

Asp Phe Ser His Asp Asn 
335 340 

Leu Tyr Asn Gly Thr Lys 

355 

Glu Glu Thr Asp Gly Tyr 

370 

Arg Ala Tyr Val Glu Met 
385 

Val Arg Val Leu Val Asn 
400 

Val Asp Lys Leu Gly Arg 
415 420 

Ser Phe Ala Arg Ser Gly 

. 435 



Thr Val Ala Arg Thr 
245 

Leu Phe Thr His Asp 

265 

Gly Lys Tyr Tyr Gly 

280 

Gly Val Gly Phe Val 
295 

Val Gin Asp His Thr 
310 

Thr Phe Pro Leu Asn 
325 

Thr Met Val Ser lie 

345 

Pro Leu Ser Thr Thr 

360 

Ser Ala Ser Trp Thr 
375 

Met Gin Cys Glu Ala 
390 

Asp Arg Val Val Pro 
405 

Cys Lys Arg Asp Asp 

425 

Gly Asn Trp Glu Glu 

440 



<210> 96 
<211> 1404 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
phy t ase - 1 0 - t hermo -5-Q50T-K91A 



<220> 
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<221> sig_j>eptide 
<222> (1) . . (69) 

<220> 

<221> CDS 

<222> (1) . . (1401) 

<220> 

<221> mat_peptide 
<222> (70) . . (1401) 

<400> 96 

atg ggc gtg ttc gtc gtg eta ctg tec att gec acc ttg ttc ggt tec 48 
Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 

-20 -15 -10 

aca tec ggt acc gee ttg ggt cct cgt ggt aat tct cac tct tgt gac 96 
Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

act gtt gac ggt ggt tac caa tgt ttc cca gaa att tct cac ttg tgg 144 
Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu lie Ser His Leu Trp 
10 15 20 25 

ggt aca tac tct cca ttc ttc tct ttg get gac gaa tct get att tct 192 
Gly Thr Tyr Ser Pro Phe Phe Ser Leu Ala Asp Glu Ser Ala lie Ser 

30 35 40 

cca gac gtt cca aag ggt tgt aga gtt act ttc gtt caa gtt ttg tct 24 0 
Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 

aga cac ggt get aga tac cca act tct tct gcg tct aag get tac tct 288 
Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
60 65 70 

get ttg att gaa get att caa aag aac get act get ttc aag ggt aag 336 
Ala Leu lie Glu Ala lie Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 

tac get ttc ttg aag act tac aat tac act ttg ggt get gac gac ttg 384 
Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
90 95 100 105 

act cca ttc ggt gaa caa caa atg gtt aac tct ggt att aag ttc tac 432 
Thr Pro Phe Gly Glu Gin Gin Met Val Asn Ser Gly lie Lys Phe Tyr 

110 115 120 

aga aga tac aag get ttg get aga aag att gtt cca ttc att aga get 480 
Arg Arg Tyr Lys Ala Leu Ala Arg Lys lie Val Pro Phe He Arg Ala 

125 130 135 
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tct ggt tct gac aga gtt att get tct gec gaa aag ttc att gaa ggt 528 

Ser Gly Ser Asp Arg Val lie Ala Ser Ala Glu Lys Phe lie Glu Gly 
140 145 150 

ttc caa tct get aag ttg get gac cca ggt get aac cca cac caa get 576 
Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ala Asn Pro His Gin Ala 
155 160 165 

tct cca gtt att aac gtt att att cca gaa ggt get ggt tac aac aac 624 
Ser Pro Val lie Asn Val lie lie Pro Glu Gly Ala Gly Tyr Asn Asn 
170 175 180 185 

act ttg gac cac ggt ttg tgt act get ttc gaa gaa tct ace eta ggt 672 
Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Glu Ser Thr Leu Gly 

190 195 200 

gac gac gtt gaa get aac ttc act get gtt ttc get cca cca att aga 720 
Asp Asp Val Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro lie Arg 

205 210 215 

get aga ttg gaa get cac ttg cca ggt gtt aac ttg act gac gaa gac 768 
Ala Arg Leu Glu Ala His Leu Pro Gly Val Asn Leu Thr Asp Glu Asp 
22 0 225 230 

gtt gtt aac ttg atg gac atg tgt cca ttc gac act gtt get aga act 816 
Val Val Asn Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 
235 240 245 

tct gac get act caa ttg tct cca ttc tgt gac ttg ttc act cac gac 864 
Ser Asp Ala Thr Gin Leu Ser Pro Phe Cys Asp Leu Phe Thr His Asp 
250 255 260 265 

gaa tgg att caa tac gac tac ttg caa tct ttg ggt aag tac tac ggt 912 
Glu Trp lie Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 

270 275 280 

tac ggt get ggt aac cca ttg ggt cca get caa ggt gtt ggt ttc gtt 960 
Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val 

285 290 295 

aac gaa ttg att get aga ttg act cac tct cca gtt caa gac cac act 1008 
Asn Glu Leu lie Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
300 305 310 

tct act aac cac act ttg gac tct aac cca get act ttc cca ttg aac 1056 
Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 
315 320 325 

get act ttg tac get gac ttc tct cac gac aac act atg gtt tct att 1104 
Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met Val Ser lie 
330 335 340 345 
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ttc ttc get ttg ggt ttg tac aac ggt act aag cca ttg tct act act 1152 
Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 

350 355 360 

tct gtt gaa tct att gaa gaa act gac ggt tac tct get tct tgg act 1200 
Ser Val Glu Ser He Glu Glu Thr Asp Gly Tyr Ser Ala Ser Trp Thr 

365 370 375 

gtt cca ttc get get aga get tac gtt gaa atg atg caa tgt gaa get 1248 
Val Pro Phe Ala Ala Arg Ala Tyr Val Glu Met Met Gin Cys Glu Ala 
380 385 390 

gaa aag gaa cca ttg gtt aga gtt ttg gtt aac gac aga gtt gtt cca 1296 
Glu Lys Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 
395 400 405 

ttg cac ggt tgt get gtt gac aag ttg ggt aga tgt aag aga gac gac 1344 
Leu His Gly Cys Ala Val Asp Lys Leu Gly Arg Cys Lys Arg Asp Asp 
410 415 420 425 

ttc gtt gaa ggt ttg tct ttc get aga tct ggt ggt aac tgg gaa gaa 1392 
Phe Val Glu Gly Leu Ser Phe Ala Arg Ser Gly Gly Asn Trp Glu Glu 

430 435 440 

tgt ttc get taa 1404 
Cys Phe Ala 



<210> 97 
<211> 467 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Consensus 
phytase-10-thermo-5-Q50T-K91A 

<400> 97 

Met Gly Val Phe Val Val Leu Leu Ser He Ala Thr Leu Phe Gly Ser 

-20 -15 -io 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn Ser His Ser Cys Asp 
-5 -11 5 

Thr Val Asp Gly Gly Tyr Gin Cys Phe Pro Glu He Ser His Leu Trp 
10 15 20 25 

Gly Thr Tyr Ser Pro Phe Phe Ser Leu Ala Asp Glu Ser Ala He Ser 

30 35 40 

Pro Asp Val Pro Lys Gly Cys Arg Val Thr Phe Val Gin Val Leu Ser 

45 50 55 

Arg His Gly Ala Arg Tyr Pro Thr Ser Ser Ala Ser Lys Ala Tyr Ser 
60 65 70 
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Ala Leu lie Glu Ala lie Gin Lys Asn Ala Thr Ala Phe Lys Gly Lys 
75 80 85 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu 
90 95 100 105 

Thr Pro Phe Gly Glu Gin Gin Met Val Asn Ser Gly lie Lys Phe Tyr 

110 115 120 

Arg Arg Tyr Lys Ala Leu Ala Arg Lys lie Val Pro Phe lie Arg Ala 

125 130 135 

Ser Gly Ser Asp Arg Val lie Ala Ser Ala Glu Lys Phe lie Glu Gly 
140 145 150 

Phe Gin Ser Ala Lys Leu Ala Asp Pro Gly Ala Asn Pro His Gin Ala 
155 160 165 

Ser Pro Val lie Asn Val lie lie Pro Glu Gly Ala Gly Tyr Asn Asn 
170 175 180 185 

Thr Leu Asp His Gly Leu Cys Thr Ala Phe Glu Glu Ser Thr Leu Gly 

190 195 200 

Asp Asp Val Glu Ala Asn Phe Thr Ala Val Phe Ala Pro Pro lie Arg 

205 210 215 

Ala Arg Leu Glu Ala His Leu Pro Gly Val Asn Leu Thr Asp Glu Asp 
220 225 230 

Val Val Asn Leu Met Asp Met Cys Pro Phe Asp Thr Val Ala Arg Thr 
235 240 245 

Ser Asp Ala Thr Gin Leu Ser Pro Phe Cys Asp Leu Phe Thr His Asp 
250 255 260 265 

Glu Trp lie Gin Tyr Asp Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly 

270 275 280 

Tyr Gly Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val 

285 290 295 

Asn Glu Leu lie Ala Arg Leu Thr His Ser Pro Val Gin Asp His Thr 
300 305 310 

Ser Thr Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 
315 320 325 

Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Asn Thr Met Val Ser lie 
330 335 340 345 



Phe Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 

350 355 360 
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Ser Val Glu Ser lie Glu Glu Thr Asp 

365 370 

Val Pro Phe Ala Ala Arg Ala Tyr Val 
380 385 

Glu Lys Glu Pro Leu Val Arg Val Leu 
395 400 



Leu His Gly Cys Ala Val Asp Lys 
410 415 

Phe Val Glu Gly Leu Ser Phe Ala 

430 



99 

Gly Tyr Ser Ala Ser Trp Thr 

375 

Glu Met Met Gin Cys Glu Ala 

390 

Val Asn Asp Arg Val Val Pro 
405 

Leu Gly Arg Cys Lys Arg Asp Asp 
420 425 

Arg Ser Gly Gly Asn Trp Glu Glu 
435 440 



Cys Phe Ala 
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The search has been performed on the sequences as they are 
described in the Sequence listing. 
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